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NONHUMAN PREGNANE X RECEPTOR SEQUENCES FOR USE IN 
COMPARATIVE PHARMACOLOGY 

Field of the Invention 

The present invention relates to nonhuman pregnane X 
receptor (PXR) polynucleotide and polypeptide sequences 
isolated from fish, canines, porcine and primates and their 
use in comparative pharmacology. 

Backgro und of the Invention 

Members of the cytochrome P450 family of hemoproteins 
are critical in the oxidative metabolism of a wide variety of 
endogenous substances and xenobiotics, including various 
carcinogens and toxins (Nebert et al. (1987) Ann. Rev. 
Biochem. 56:945-993). In man, P450 3A4 monooxygenase, also 
referred to as CYP3A4 monooxygenase, plays a major role in the 
biotransformation of drugs due to its abundance in liver and 
intestine and its broad substrate specificity. P450 3A4 
catalyzes the metabolism of >60% of all drugs that are in use 
including steroids, immunosuppressive agents imidazole 
antimycotics, and macrolide antibiotics (Maurel, P. in 
Cytochrome P450: metabolic and toxicological aspects (ed. 
Ioannides, C.) 241-270 (CRC Press, Inc. Boca Raton, FL 1996). 

Expression of. P450 3A4 is induced both in vivo and in 
primary hepatocytes in response to treatment with a variety of 
compounds including, but not limited to, commonly used drugs 
such as the glucocorticoid dexamethasone, the antibiotic 
rifampicin, the antimycotic clotrimazole, and the 
hypocholesterolemic agent' loveatat in (Maurel, P. in Cytochrome 
P450: metabolic and toxicological aspects (ed. Ioannides, C.) 
241-270 (CRC Press, Inc. Boca Raton, FL 1996); Guzelian, P.S. 
in Microsomes and Drug Oxidation (eds. Miners, J.O. , Birkett, 
D.J., Drew, R. & McManus, M.) 148-155 (Taylor and Francis,' 
London, 1988)). The inducibility of P450 3A4 expression 
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levels coupled with its broad substrate specificity represent- 
the basis for many drug interactions in patients undergoing 
combination therapy. Thus, analysis of the effects of new 
compounds on P450 3A4 gene expression is an important aspect 
in drug development. 

While attempts have been made to develop in vivo and in 
vitro assays with which to profile the effects of compounds on 
P450 3A4 expression levels, poorly understood species-specific 
variations have limited the utility of using animals and their 
tissues for testing purposes. Thus, analysis of the effects 
of new compounds on P450 3A4 gene expression has been largely 
restricted to laborious assays involving human liver tissue. 

Nuclear hormone receptors comprise a large superfamily 
of ligand-modulated transcription factors that, in part, 
mediate responses to steroids, retinoids, and thyroid hormones 
(for review see Beato et al., (1995) Cell 83:851-857; Kastner 
et al., (1995) Cell 83:859-869; Mangelsdorf and Evans, (1995) 
Cell 83:841-850). Detailed analysis of the receptors, most 
notably the steroid class of receptors, has revealed multiple 
discrete functional modules within the family that display 
generalized functional characteristics (Tzukerman et al., 
(1994) Mol. Endocrinol. 8:21-30). A variable amino-terminal 
domain (A/B) is present that typically contains a strong and 
autonomous activation function (AF1) , shown to be critical for 
cell and target gene specificity (Tora et al., (1988) Nature 
333: 677-684). A more carboxyl-terminal central region 
contains a DNA binding domain (DBD) characterized by two C4- 
type zinc fingers. The DBD binds to specific genomic response 
elements and thereby regulates the transcriptional activity of 
select genes containing the response elements. At the distal 
carboxyl terminus, a ligand binding domain (LBD) is present 
containing a highly conserved second transactivation function 
(AF2) that is important for hormone-dependent transcriptional 
transactivation (Lanz and Rusconi, (1994) Endocrinology 135: 
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2183-2195). Sequences that function in nuclear localization, 
receptor dimerization, and interaction with heat-shock 
proteins (Gronemeyer and Laudet, (1995) CCQ 2:1173-1308) are 
also present within the nuclear receptor substructure. 
5 Through the coordinated action of these separate functional 
domains, nuclear receptor activation by ligand culminates in 
modulation of target gene expression (Tsai and O'Malley, 
(1994) Ann. Rev. Biochem. 63: 451-486) and in certain cases, 
cross-talks with other cell signaling pathways such as the NF- 

10 kB (Stein and Yang, (1995) Mol. Cell. Biol. 15: 4971-4979) and 
AP-1 (Paech et al., (1998) Science 277: 1508-1510). 
Ultimately, ligand alters nuclear receptor function by 
altering the constellation of protein-protein interactions in 
which the receptor is engaged (for review, see Freedman, 

15 (1999) Cell 97: 5-8). The molecular details underlying the 
multitude of cellular effects mediated by nuclear receptors 
are the subject of intense research activity. 

For the nuclear receptor for pregnane X, referred to 
herein as PXR (Unified Nomenclature Committee designation 

20 NR1I2), it has been shown that PXR cell-based and binding 
assays are predictive of in vivo effects on the cytochrome 
P450 3A4 gene (Kliewer et al. (1998) Cell 92, 73-8273-82; 
Lehmann et al. (1998) J. Clin. Invest. 102, 1016-1023; Jones 
et al. (2000) Mol. Endo. 27-39). 

25 WO 99/48915 discloses human PXR which binds to the CYP 

promoter rif ampicin/dexamethasone response element in 
cytochrome P450 3A4 . Also disclosed are nucleic acid 
sequences' encoding human PXR, as well as vectors and host 
cells for expression of the human receptor, and methods for 

30 using this receptor in vitro to screen compounds for their 
ability to modulate P450 3A4 expression in humans. 

However, nonhuman animal PXRs, particularly animals 
well accepted for use in preclinical studies, would also be 
useful in the development in vitro and in vivo animal models 

35 for profiling the effects of compounds on P450 3A4 expression 
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levels and to select preclinical models predictive of effects 
in humans. Additionally, dissecting the broader biological 
physiological role of PXR beyond cytochrome P450 gene 
induction is facilitated by an understanding of which 
compounds activate PXR in animal models of interest. Thus, 
comparative pharmacology of PXR opens the possibility of 
extending the utility of PXR in drug development and toxicity 
assays as well as validating this receptor as a useful target 
in disease. 

Mouse PXR has been cloned and sequenced (Kliewer et al. 
(1998) Cell 92:73-82). Rat and rabbit PXR have also been 
cloned and sequenced (Jones et al. (2000) Molecular 
Endocrinology 14 (1) :27-39; Zhang et al. (1999) Archives of 
Biochemistry and Biophysics 368 (1) : 14-22) . 

The present invention relates to PXRs for other nonhuman 
animals. 



Summary of the Invention 

Polynucleotide and polypeptide sequences for pregnane 
X receptors (PXR) isolated from fish, canines, porcine and 
primates are provided. The novel PXR sequences are useful as 
screening targets for the identification and development of 
selective PXR compounds. These agents are particularly 
useful in PXR comparative pharmacology and selecting 
appropriate animal models for preclinical studies predictive 
of effects in humans. 

Accordingly, the present invention provides isolated PXR 
polypeptides comprising : 

(a) an amino acid sequence of SEQ ID NO: 2, 4, 6 or 8; 

(b) a variant of an amino acid sequence as defined in 
(a) which modulates P450 3A4 levels or activity; or 

(c) a fragment of (a) or (b) which modulates P450 3A4 
levels or activity. 

According to another aspect of the invention there is 
35 provided polynucleotides encoding polypeptides of the 
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invention, said polynucleotides comprising: 

(a) a nucleic acid sequence of SEQ ID NO: 1, 3, 5, or 

7; 

(b) a nucleic acid sequence which hybridizes under 
stringent conditions to the nucleic acid sequence as defined 
in (a) ; 

(c) a nucleic acid sequence that is degenerate as a 
result of the genetic code to the nucleic acid sequences as 
defined in (a) or (b) ; or 

(d) a nucleic acid sequence having at least 60% 
identity to the nucleic acid sequences as defined in (a), (b) 
or (c) . 

Other related aspects of the present invention include 
expression vectors comprising polynucleotides of the invention 
which are capable of expressing a polypeptide of the 
invention, host cells comprising an expression vector of the 
invention, methods of producing a polypeptide of the invention 
which comprise maintaining a host cell of the invention under 
conditions suitable for' obtaining expression of the 
polypeptide and isolating said polypeptide, antibodies 
specific for a polypeptide of the invention, and transgenic 
nonhuman animals expressing a mutant PXR or a PXR from another 
species . 

The present invention also provides methods for 
identification of substances that modulate PXR activity and/or 
expression. In one embodiment, the method comprises 
contacting a polypeptide, polynucleotide, expression vector or 
host cell of the invention with a test substance and 
determining the effect of the test substance on the activity 
and/or expression of the polypeptide to determine whether the 
test substance modulates PXR activity and/or expression. In 
another embodiment, the test substance is administered to a 
nonhuman transgenic animal expressing a mutant PXR or a PXR 
from a different species and the effects of the test substance 
on expression and/or activity of the receptor are examined. 
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In addition, the present invention relates to compounds- 
which modulate PXR activity and which are identifiable by the 
methods referred to above. 

5 Brief Description of the Sequences 

SEQ ID NO: 1 shows a nucleotide and amino acid sequence 
of a ligand binding domain of a canine pregnane X receptor. 

SEQ ID NO: 2 shows the amino acid sequence of the ligand 
binding domain of the canine pregnane X receptor as depicted 
10 in SEQ ID NO:l. 

SEQ ID NO: 3 shows a nucleotide and amino acid sequence of 
a primate pregnane X receptor. 

SEQ ID NO: 4 shows the amino acid sequence of the primate 
pregnane X receptor as depicted in SEQ ID NO: 3. 
15 SE Q I D NO: 5 shows a nucleotide and amino acid sequence of 

a ligand binding domain of a porcine pregnane X receptor. 

SEQ ID NO: 6 shows the amino acid sequence of the ligand 
binding domain of the porcine pregnane X receptor as depicted 
in SEQ ID NO:5. 

20 SEQ ID NO: 7 shows a nucleotide and amino acid sequence of 

a ligand binding domain of a Zebrafish pregnane X receptor. 

SEQ ID NO: 8 shows the amino acid sequence of the ligand 
binding domain of the Zebrafish pregnane X receptor as 
depicted in SEQ ID NO: 7. 

25 

Brief Description of the Figures 

Figure 1 is a bargraph showing the activation of a 
primate PXR in the presence of various steroids and 
xenobiotics . 

30 Figure 2 is a bargraph showing the activation of a 

canine PXR in the presence of various steroids and 
xenobiotics. 

Figure 3 is a bargraph showing the activation of a 
porcine PXR in the presence of various steroids and 
35 xenobiotics. 
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Figure 4 is a bargraph showing the activation of a fish- 
PXR in the presence of various steroids and xenobiotics. 

Figure 5 is a bargraph showing the activation of a 
primate PXR in the presence of various bile acids. 
5 Figure 6 is a bargraph showing the activation of a 

canine PXR in the presence of various bile acids. 

Figure 7 is a bargraph showing the activation of a 
porcine PXR in the presence of various bile acids. 

Figure 8 is a bargraph showing the activation of a fish 
10 PXR in the presence of various bile acids. 

Detailed Description of the Invention 

Throughout the present specification and the accompanying 
claims the words "comprise" and "include" and variations such 

15 as "comprises", "comprising", "includes" and "including" are 
to be interpreted inclusively. That is, these words are 
intended to convey the possible inclusion of other elements or 
integers not specifically recited, where the context allows. 
The present invention relates to nonhuman animal 

20 orthologs of pregnane X receptors, referred to herein as PXRs, 
and variants thereof. More specifically, the present 
invention relates to PXRs isolated from a canine, porcine, 
primate or fish. Nucleotide sequence information for the 
full length monkey PXR of the present invention is provided in 

25 SEQ ID NO: 3. A polypeptide sequence of the monkey PXR is 
also provided in SEQ ID NO: 4. Sequence information for the 
ligand binding domains of the dog, pig and Zebrafish PXRs of 
the present invention is provided in SEQ ID NO: 1, 5 and 7 
(nucleotide and amino acid), respectively. Polypeptides of 

30 the ligand binding domains of the dog, pig and Zebrafish PXRs 
are also provided in SEQ ID NO: 2, 6, and 8. 

Polypeptides of the invention consist essentially of the 
amino acid sequences of SEQ ID NO: 2, 4, 6 or 8, a variant of 
that sequence, or a fragment of either thereof. Polypeptides 

35 of the invention may be in a substantially isolated form. It 
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will be understood that the polypeptide may be mixed with* 
carriers or diluents which will not interfere with the 
intended purpose of the polypeptide and still be regarded as 
substantially isolated. A polypeptide of the invention may 
5 also be in a substantially purified form, in which case it 
will generally comprise the polypeptide in a preparation in 
which more than 50%, e.g. more than 80%, 90%, 95% or 99%, by 
weight of the polypeptide in the preparation is a polypeptide 
of the invention. Routine methods, can be employed to purify 

10 and/or synthesize the polypeptides according to the invention. 
Such methods are well understood by persons skilled in the 
art, and include techniques such as those disclosed in 
Sambrook et al, Molecular Cloning: a Laboratory Manual, 2 nd 
Edition, CSH Laboratory Press, 1989, the disclosure of which 

15 is included herein in its entirety by way of reference. 

The term "variant" refers to a polypeptide that has a 
same essential character or basic biological functionality as 
the selected PXR. As demonstrated herein, biological 
functionalities of nonhuman PXRs vary between species. By 

20 "selected PXR receptor", as used herein, it is meant a canine, 
primate, porcine or fish PXR as described herein. Further, 
for purposes of this invention a variant polypeptide is 
preferably one which binds to the same ligand as one or more 
of the nonhuman animal PXRs described herein. Preferably the 

25 polypeptide modulates P450 3A4 expression in primates, 
canines, porcine and/or fish. A polypeptide having a same 
essential character as a selected PXR of the present invention 
can be identified by monitoring for activation of the selected 
PXR by an inducer of P450 3A4 . A full-length variant 

30 polypeptide is preferably one which includes the entire ligand 
binding domain of the selected PXR. 

In another aspect of the invention, a variant is one 
which does not show the same activity as the selected PXR but 
rather inhibits a basic function of the PXR. For example, a 

35 variant polypeptide is one which inhibits activation of a 
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selected PXR upon exposure to an inducer of P4 50 3A4, for' 
example by binding to a PXR ligand to prevent activity 
mediated by ligand binding to the selected PXR. 

Typically, polypeptides with more than about 65% identity 
5 preferably at least 80% , at least 85% or at least 90% and 
particularly preferably at least 95%, at least 97% or at least 
99% identity, with the amino acid sequences of SEQ ID NO: 2, 
4, 6 or 8 are considered as variants of the proteins. 
Identity can be determined using a program such as a BLAST 

10 sequence alignment program. Such variants may include allelic 
variants and the deletion, modification or addition of single 
amino acids or groups of amino acids within the protein 
sequence, as long as the peptide maintains a basic biological 
functionality of the selected PXR. 

15 Amino acid substitutions may be made, for example from 1, 

2 or 3 to 10, 15, 20 or 30 substitutions. The modified 
polypeptide generally retains the same activity as a selected 
PXR. Conservative substitutions may be made, for example 
according to the following Table 1. Amino acids in the same 

20 block in the second column and preferably in the same line in 
the third column may be substituted for each other. 



TABLE 1 



ALIPHATIC 


Non-polar 


GAP 


I L V 


Polar-uncharged 


C S T M 


N Q 


Polar-charged 


D E 


K R 


AROMATIC 




H F W Y 



25 
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Exemplary substitutions are shown in Table 2. 

. TABLE 2 



Original Residue 


Exemplary Substitutions 


Ala 


Gly; Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Cys 


Ser 


i Gin 


Asn 


Glu 


Asp 


Gly 


Ala 


His 


Asn; Gin 


He 


Leu; Val 


Leu 


He; Val 


Lys 


Arg 


Met 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Shorter polypeptide sequences, also referred to herein as 
5 "fragments" are within the scope of the invention. For 
example, a peptide of at least 15, 20 or 30 amino acids or up 
to 50, 60, 70, 80, 100, 150 or 200 amino acids in length is 
considered to fall within the scope of the invention as long 
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as it demonstrates a basic biological functionality of a- 
selected PXR. In particular, but not exclusively, this aspect 
of the invention encompasses the situation when the protein is 
a fragment of the complete protein sequence and may represent 
5 particularly, the A/B domain (e.g., about amino acids 1-40 of 
SEQ ID NO: 4), the DBD (e.g., about amino acids 41-105 of SEQ 
ID NO:4) or the LBD (e.g., about amino acids 106-434 of SEQ ID 
NO:4 or SEQ ID NO:2, 6 or 8), alone or in combination. Such 
fragments can be used to construct chimeric receptors 
10 preferably with another nuclear receptor, more preferably with 
another member of the family of nuclear receptors involved in 
P450 3A4 regulation such as the CAR nuclear receptor, or as 
intermediates in the production of the full length sequences. 

Such fragments of PXR or a variant thereof can also be 
15 used to raise anti-PXR antibodies. In this embodiment, the 
fragment may comprise an epitope of a selected PXR polypeptide 
and may otherwise not demonstrate the ligand binding or other 
properties of the selected PXR, 

Polypeptides of the invention may be chemically modified, 
20 e.g. post-translationally modified. For example, they may be 
glycosylated or comprise modified amino acid residues. They 
may also be modified by the addition of histidine residues to 
assist their purification or by the addition of a signal 
sequence to promote insertion into the cell membrane. Such 
25 modified polypeptides fall within the scope of the term 
"polypeptide" of the invention. 

The invention also includes nucleotide sequences that 
encode for canine, primate, porcine or fish PXR or a variant 
thereof, as well as nucleotide sequences which are 
30 complementary thereto. The nucleotide sequence may be RNA or 
DNA including genomic DNA, synthetic DNA or cDNA. Preferably 
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the nucleotide sequence is a DNA sequence, and most preferably 
a cDNA sequence. Nucleotide sequence information for the 
canine, primate, porcine and Zebrafish PXRs of the present 
invention are provided in SEQ ID NO: 1, 3, 5 and 7, 
respectively. Such nucleotides can be isolated from cells of 
the selected species, namely, canine, primate, porcine or 
Zebrafish, or synthesized according to methods well known in 
the art, as described by way of example in Sambrook et al, 
1989. 

Typically a polynucleotide of the invention comprises a 
contiguous sequence of nucleotides which is capable of 
hybridizing under selective conditions to the coding sequence 
or the complement of the coding sequence of either SEQ ID NO: 
1, 3, 5 or 7 . 

The polynucleotide can be in single-, double- or triple- 
stranded form. In addition, polynucleotide, as used herein, 
can refer to triple-stranded regions comprising RNA or DNA or 
both. The strands in such regions may be from the same 
molecule or from different molecules. As used herein the term 
polynucleotide includes nucleic acids that contain one or more 
modified (e.g., tritylated) or unusual (e.g., inosine) bases. 

Thus, DNAs or RNAs with backbones modified for stability or 
for other reasons are polynucleotides as that term is intended 
herein. 

A polynucleotide of the invention can hybridize to the 
coding sequence or the complement of the coding sequence of 
SEQ ID NO: 1, 3, 5 or 7 at a level significantly above 
background. Background hybridization may occur, for example, 
because of other cDNAs present in a cDNA library. The signal 
level generated by the interaction between a polynucleotide of 
the invention and the coding sequence or complement of the 
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coding sequence of SEQ ID NO: 1, 3, 5 or 7 is typically at 
least 10-fold, preferably at least 100-fold, as intense as 
interactions between other polynucleotides and the coding 
sequence of SEQ ID NO: 1, 3, 5 or 7 . The intensity of 
5 interaction may be measured, for example, by radiolabeling the 
probe, e.g. with 32 P. Selective hybridization may typically 
be achieved using conditions of medium to high stringency. 
However, such hybridization may be carried out under any 
suitable conditions known in the art (see Sambrook et al, 
10 1989) , For example, if high stringency is required suitable 
conditions include from 0.1 to 0.2 x SSC at 60°C up to 65°C. 
If lower stringency is required suitable conditions include 

2 x SSC at 60°C. 

The coding sequence of SEQ ID NO: 1, 3, 5 or 7 may be 
15 modified by nucleotide substitutions, for example from 1, 2 or 

3 to 10, 15, 25, 50 or 100 substitutions. The polynucleotide 
of SEQ ID NO: 1, 3, 5 or 7 may alternatively or additionally 
be modified by one or more insertions and/or deletions and/or 
by an extension at either or both ends. A polynucleotide may 

20 include one or more introns. For example, in one embodiment, 
the polynucleotide may comprise genomic DNA. Additional 
sequences such as signal sequences which may assist in 
insertion of the polypeptide in a cell membrane may also be 
included. The modified polynucleotide generally encodes a 

25 polypeptide which has the same activity as a selected PXR. 
Alternatively, a polynucleotide encodes a ligand-binding 
portion of a polypeptide or a polypeptide which inhibits an 
activity of a selected PXR. Degenerate substitutions may be 
made and/or substitutions may be made which would result in a 

30 conservative amino acid substitution when the modified 
sequence is translated, for example as shown in Table 1 or 2 
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above. 

A nucleotide sequence which is capable of selectively 
hybridizing to the complement of the DNA coding sequence of 
SEQ ID NO: 1, 3, 5, or 7 will generally have at least 60%, at 
5 least 70%, at least 80%, at least 90%, at least 95%, at least 
. 98% or at least 99% sequence identity to the coding sequence 
of SEQ ID NO: 1, 3, 5 or 7 over a region of at least 20, 
preferably at least 30, for instance at least 40, at least 60, 
more preferably at least 100 contiguous nucleotides, or most 
10 preferably .over the full length of SEQ ID NO: 1, 3, 5 or 7. 

For example the UWGCG Package provides the BESTFIT 
program which can be used to calculate homology (for example 
used' on its default settings) (Devereux et al., (1984) Nucleic 
Acids Res. 12: 387-395). The PILEUP and BLAST algorithms can 
' 15 be used to calculate homology or line up sequences (typically 
on their default settings), for example as described in 
Altschul (1993) J. Mol. Evol. 36: 290-300; Altschul et al 
(1990) J. Mol. Biol. 215: 403-410. 

Software for performing BLAST analyses is publicly 
20 available through the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/) . This algorithm 
involves first identifying high scoring sequence pair (HSPs) 
by identifying short words of length W in the query sequence 
that either match or satisfy some positive-valued threshold 
25 score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word 
score threshold (Altschul et al. 1990). These initial 
neighborhood word hits act as seeds for initiating searches to 
find HSPs containing them. The word hits are extended in both 
30 directions along each sequence for as far as the cumulative 
alignment score can be increased. Extensions for the word 
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hits in each direction are halted when: the cumulative- 
alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring 
5 residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T and X determine the 
sensitivity and speed of the alignment. The BLAST program 
uses as defaults a word length (W) of 11, the BLOSUM62 scoring 
matrix (see Henikoff and Henikoff (1992) Proc. Natl. Acad. 
10 Sci. USA 89: 10915-10919) alignments (B) of 50, expectation 
(E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm performs a statistical analysis of 
the similarity between two sequences; see e.g., Karlin and 
Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5787. One 

15 measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N)), which provides an indication 
of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a 
sequence is considered similar to another sequence if the 

20 smallest sum probability in comparison of the first sequence 
to the second sequence is less than about 1, preferably less 
than about 0.1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

Any combination of the above mentioned degrees of 
25 sequence identity and minimum sizes may be used to define 
polynucleotides of the invention, with the more stringent 
combinations (i.e. higher sequence identity over longer 
lengths) being preferred. Thus, for example a polynucleotide 
which has at least 90% sequence identity over 25, preferably 
30 over 30 nucleotides forms one aspect of the invention, as does 
a polynucleotide which has at least 95% sequence identity over 
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40 nucleotides. 

The nucleotides according to the invention have utility 
in production of the polypeptides according to the invention. 
Produciton may take place in vitro, in vivo or ex vivo. The 
5 nucleotides may be involved in recombinant protein synthesis, 
as therapeutic agents in their own right, utilized in gene 
therapy techniques, and/or utilized in the production of 
nonhuman transgenic animals. Nucleotides complementary to 
those encoding PXR, or antisense sequences, may also be used 
10 therapeutically. 

Polynucleotides of the invention may be used as primers, 
e.g. PCR primers or primers for an alternative amplification 
reaction, or as probes, e.g. polynucleotides detectably 
labeled by conventional means using radioactive or non- 
15 radioactive labels. In addition, the polynucleotides may be 
cloned into vectors. 

Such primers, probes and other fragments will preferably 
be at least 10, preferably at least 15 or at least 20, for 
example at least 25, at least 30 or at least 40 nucleotides in 

20 length. They will typically be up to 40, 50, 60, 70, 100 or 
150 nucleotides in length. Probes and fragments can be longer 
than 150 nucleotides in length, for example up to 200, 300, 
400, 500, 600, 700 nucleotides in length, or even up to. a few 
nucleotides, such as five or ten nucleotides, short of the 

25 coding sequence of SEQ ID NO: 1, 3, 5 or 7 . 

The polynucleotides of the present invention are also 
useful in the production of chimeric receptors or fusion 
proteins having a PXR component which comprises at least a DNA 
binding domain or a ligand binding domain of a canine, 
30 primate, porcine or fish PXR and a non-PXR derived sequence. 
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Non-PXR derived sequences can be selected so as to be* 
suitable for the purpose to be served by the chimeric 
receptor. Examples of such sequences include, but are not 
limited to, glutathione-S-transferase, the DNA binding domain 

5 of yeast transcription factor GAL 4 and other DNA binding 
domains such as the DNA binding domains for estrogen or 
glucocorticoid receptors, and the viral VP16 transcriptional 
activation domain. Chimeric receptors of the present 
invention may further comprise a detectable label such as a 

10 radioactive or fluorescent label. The chimeric receptors may 
also be bound to a solid support such as glass or plastic 
particles or plates or a filter. 

The present invention also includes expression vectors 
that comprise nucleotide sequences encoding the polypeptides 

15 or variants thereof of the invention. Such expression vectors 
are routinely constructed in the art of molecular biology and 
may for example involve the use of plasmid DNA and appropriate 
initiators, promoters, enhancers and other elements, such as 
for example polyadenylation signals which may be necessary, 

20 and which are positioned in the correct orientation, in order 
to allow for protein expression. Other suitable vectors would 
be apparent to persons skilled in the art based upon teachings 
provided herein and what is known in the art. By way of 
further example in this regard we refer to Sambrook et al. 

25 1989. 

Polynucleotides according to the invention may also be 
inserted into the vectors described above in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 

30 polynucleotides may also be produced by synthetic means. Such 
antisense polynucleotides may be used as test compounds in the 
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assays of the invention or may be useful therapeutically. 

Preferably, a polynucleotide of the invention used in a 
vector is operably linked to a control sequence which is 
capable of providing for the expression of the coding sequence 
by the host cell, i.e. the vector is an expression vector. 
The term "operably linked" refers to a juxtaposition wherein 
the components described are in a relationship permitting them 
to function in their intended manner. For example, a 
regulatory sequence, such as a promoter, "operably linked" to 
a coding sequence is positioned in such a way that expression 
of the coding sequence is achieved under conditions compatible 
with the regulatory sequence. 

The vectors may be for example, plasmid, virus or phage 
vectors provided with an origin of replication, optionally a 
promoter for the expression of the polynucleotide and 
optionally a regulator of the promoter. The vectors may 
contain one or more selectable marker genes, for example an 
ampicillin resistance gene in the case of a bacterial plasmid 
or a resistance gene for a fungal vector. Vectors may be used 
in vitro, for example, for the production of DNA or RNA or 
used to transfect or transform a host cell, for example, a 
mammalian host cell. The vectors may also be adapted to be 
used in vivo, for example in a method of gene therapy or in 
the production of nonhuman transgenic animals, preferably 
mice. Additional vector components known in the art are 
suitable for use in the vectors of the present invention and 
include, for example, processing sites such as a 
polyadenylation signal, ribosome binding sites, RNA splice 
sites, and transcriptional termination sequences. 

Promoters and other expression regulation signals may be 
selected to be compatible with the host cell for which 
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expression is designed. Examples of yeast promoters which can* 
be used in the present invention include S. cerevisiae GAL4 
and ADH promoters, and S. pombe NMT1 and ADH promoters. Viral 
promoters can also be used. Examples of viral promoters 
5 include, but are not limited to, the Moloney murine leukemia 
virus long terminal repeat (MMLV LTR) , the Rous sarcoma virus 
(RSV) LTR promoter, the SV40 promoter, the human 
cytomegalovirus (CMV) IE promoter, adenovirus, HSV promoters 
(such as the HSV IE promoters), or HPV promoters, particularly 
10 the HPV upstream regulatory region (URR) . A mammalian 
promoter useful in the present invention is the 
metallothionein promoter which can be induced in response to 
heavy metals such as cadmium and p-actin promoters. Tissue- 
specific promoters are especially preferred. All these 
15 promoters, as well as additional promoters useful in the 
present invention, are readily available in the art. 

The vector may further include sequences flanking the 
polynucleotide giving rise to polynucleotides which comprise 
sequences homologous to eukaryotic genomic sequences, 

20 preferably mammalian genomic sequences or viral genomic 
sequences. This allows the introduction of the 

polynucleotides of the invention into the genome of eukaryotic 
cells or viruses by homologous recombination. In particular, 
a plasmid vector comprising the expression cassette flanked by 

25 viral sequences can be used to prepare a viral vector suitable 
for delivering the polynucleotides of the invention to a 
mammalian ^ cell . Other examples of suitable viral vectors 
include herpes simplex viral vectors and retroviruses, 
including lentiviruses, adenoviruses, adeno-associated viruses 

30 and HPV viruses. Gene transfer techniques using these viruses 
are known to those skilled in the art. Retrovirus vectors for 



WO 02/094865 PCT/US02/16445 

20 

example may be used to stably integrate the polynucleotide* 
into the host genome. Replication-defective adenovirus 
vectors by contrast remain episomal and therefore allow 
transient expression. 

5 The invention also includes cells that have been modified 

. to express the PXR polypeptide or a variant thereof. Such 
cells include transient, or preferably stable higher 
eukaryotic cell lines such as mammalian cells or insect cells, 
lower eukaryotic cells such as yeast, or prokaryotic cells 
10 such as bacterial cells. Examples of cells which may be 
modified by insertion of vectors encoding for a polypeptide 
according to the invention include, but are not limited to, 
mammalian HEK293T, CHO, HeLa and COS cells. 

A polypeptide of the invention may also be expressed in 
15 cells of a transgenic non-human animal, preferably a mouse. 
Accordingly, transgenic non-human animals expressing a PXR 
polypeptide of the invention are also included within the 
scope of the invention. For example, transgenic mice can be 
generated that express a selected PXR of the present invention 
20 as well as the endogenous mouse PXR gene. Mice can also be 
generated in which the endogenous PXR gene is knocked out and 
then replaced by the selected PXR polynucleotide of the 
present invention. Transgenic animals can also be generated 
that express isoforms of a selected PXR as well as mutant 
25 alleles of the PXRs of the present invention. Transgenic 
animals developed by these methods can be used to screen 
compounds for drug interactions and toxicities and to study 
the regulation of P450 3A4 in vivo. 

According to another aspect, the present invention also 
30 relates to antibodies, specific for a polypeptide of the 
invention. Such antibodies are for example useful in 
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purification, isolation or screening methods involving* 
immunoprecipitation techniques or, indeed, as therapeutic 
agents in their own right. 

Antibodies can be raised against specific epitopes of the 
5 polypeptides according to the invention. Such antibodies may 
. be used to block ligand binding to the receptor. An antibody, 
or other compound, "specifically binds" to a protein when it 
binds with preferential or high affinity to the protein or 
polypeptide for which it is specific but does not 
10 substantially bind or binds with only low affinity to other 
proteins. A variety of protocols for competitive binding or 
immunoradiometric assays to determine the specific binding 
capability of an antibody are well known in the art (see for 
example Maddox et al, (1993) J. Exp. Med. 158: 1211-1226). 
15 Such immunoassays typically involve the formation of complexes 
between the specific protein and its antibody and the 
measurement of complex formation. 

Antibodies of the invention may be antibodies to the 
canine, primate, porcine or fish polypeptides or fragments 

20 thereof. For the purposes of this invention, the term 
"antibody", unless specified to the contrary, includes 
fragments which bind a polypeptide of the invention. Such 
fragments include Fv, F(ab' ) and F(ab')2 fragments, as well as 
single chain antibodies. Furthermore, the antibodies and 

25 fragments thereof may be chimeric antibodies, CDR-grafted 
antibodies or humanized antibodies. 

Antibodies may be used in methods for detecting 
polypeptides of the invention in a biological sample. In 
these methods, an antibody of the invention is first provided. 
30 A biological sample is then incubated with the antibody under 
conditions which allow for the formation of a complex between 
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the antibody and the polypeptide or antigen and the amount of' 
antibody-polypeptide complex formed is determined. Various 
methods for determining formation of an antibody-antigen 
complex are well known to those of skill in the art. 

5 For purposes of the present invention, by "biological 

. sample" it is meant to include, but is not limited to, tissue 
extracts, blood, serum, saliva, urine, cerebral spinal fluid, 
and bile. 

Antibodies of the invention may be bound to , a solid 
10 support and/or packaged into kits in a suitable container 
along with suitable reagents, controls, instructions, etc. 
Antibodies may be linked to a revealing label and thus may be 
suitable for use in methods of in vivo PXR imaging. 

Antibodies, of the invention can be produced by any 
15 suitable method. Means for preparing and characterizing 
antibodies are well known in the art, see for example Harlow 
and Lane (1988) "Antibodies: A Laboratory Manual", Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY. For example, 
an antibody may be produced by raising antibody in a host 
20 animal against the whole polypeptide or a fragment thereof, 
for example an antigenic epitope thereof, herein after the 
"immunogen" . 

A method for producing a polyclonal antibody comprises 
immunizing a suitable host animal, for example an experimental 
25 animal, with the immunogen and isolating immunoglobulins from 
the animal's serum. In this method, the animal is inoculated 
with the immunogen. Blood is subsequently removed from the 
animal and the IgG fraction purified. 

A method for producing a monoclonal antibody comprises 
30 immortalizing cells which produce the desired antibody. 
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Hybridoma cells can be produced by fusing spleen cells from an 
inoculated experimental animal with tumor cells (Kohler and 
Milstein (1975) Nature 256: 495-497). An immortalized cell 
producing the desired antibody may be selected by a 
5 conventional procedure. Hybridomas are then grown in culture 
or injected intraperitoneally for formation of ascites fluid 
or into the blood stream of , an allogenic host or 
immunocompromised host. 

For the production of both monoclonal and polyclonal 
10 antibodies,, the experimental animal is suitably a goat, 
rabbit, rat or mouse. If desired, the immunogen may be 
administered as a conjugate in which the immunogen is coupled, 
for example via a side chain of one of the amino acid 
residues, to a suitable carrier. The carrier molecule is 
15 typically a physiologically acceptable carrier. The antibody 
obtained may be isolated and, if desired, purified. 

A further aspect of the present invention relates to in 
vitro (cell-free) and in vivo (cell-based) assays that can be 
used to profile the effects of compounds, particularly 

20 potential new drugs, on P450 3A4 levels in various species. 
These assays can take any of a variety of forms. Since 
compounds that activate PXR function are inducers of P450 3A4 
gene expression, binding and activation assays using selected 
PXRs of the present invention provide efficient means to 

25 identify compounds expected to activate P450 3A4 in a selected 
species . 

Binding assays of the present invention include cell free 
assays in which a selected PXR, or the ligand binding domain 
of a selected PXR (alone or present as a fusion protein) , is 
30 incubated with a test compound which, advantageously, bears a 
detectable label (e.g. a radioactive or fluorescent label). 



WO 02/094865 PCT/US02/16445 

24 

The selected PXR, or ligand binding domain thereof, free or 
bound to the test compound, is then separated from free test 
compound using any of variety of techniques (e.g., using gel 
filtration chromatography (for example, on Sephadex G50 spin 
5 columns) or through capture on a hydroxyapatite resin) . The 
amount, of test compound bound to the selected PXR or ligand 
binding domain thereof, is then determined via detection of 
the label. 

An alternative approach for detecting radiolabeled test 
10 compound bqund to a selected PXR, or ligand binding domain 
thereof, is a scintillation proximity assay (SPA) . In this 
assay, a bead (or other particle) is impregnated with 
scintillant and coated with a molecule that can capture the 
selected PXR, or ligand binding domain thereof (e.g., 
15 streptavidin-coated beads can be used to capture biotinylated 
PXR ligand binding domain) . Radioactive counts are detected 
only when the complex of radiolabeled test compound and the 
selected PXR, or ligand binding domain thereof, is captured on 
the surface of the SPA bead bringing the radioactive label 
20 into sufficient proximity to the scintillant to emit a signal. 
This approach has the advantage of not requiring the 
separation of free test compound from bound (Nichols et al, 
Anal. Biochem. 257:112-119 (1998)). 

Assays to determine whether a test compound interacts 
25 with a selected PXR ligand binding domain can also be 
performed via a competition binding assay. In this assay, the 
selected PXR, or ligand binding domain thereof, is incubated 
with a compound known to interact with the selected PXR, which 
compound, advantageously, bears a detectable label (e.g., a 
30 radioactive or fluorescent label) . A test compound is added 
to the reaction and assayed for its ability to compete with 
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the labeled compound for binding to the selected PXR, or 
ligand binding domain thereof. A standard assay format 
employing a step to separate free known (labeled) compound 
from bound, or an SPA format, can be used to assess the 
5 ability of the test compound to compete. 

To determine if a test compound activates a selected PXR, 
and thus induces P450 3A4 expression, the ligand binding 
domain of the selected PXR is prepared (e.g., expressed) as a 
fusion protein (e.g., with glutathione-S-transf erase (GST), a 

10 histidine tag or a maltose binding protein) . The fusion 
protein and coactivator (either or both advantageously labeled 
with a detectable label, e.g., a radiolabel or fluorescent 
tag)' are incubated in the presence and absence of the test 
compound and the extent of binding of the coactivator to the 

15 fusion protein determined. The induction of interaction in 
the presence of the test compound is indicative of an 
activator of the selected PXR. 

PXR activation assays in accordance with the invention 
can be carried out using a full length PXR and a reporter 

20 system comprising one or more copies of the DNA binding site 
recognized by the PXR binding domain. More preferably, 
however, the activation assays are conducted using established 
chimeric receptor systems. For example, the ligand binding 
domain of a selected PXR can be fused to the DNA binding 

25 domain of, for example, yeast transcription factor GAL 4 , or 
that of the estrogen or glucocorticoid receptor. An 
expression vector for the chimera (e.g., a GAL4-PXR chimera) 
can be transfected into host cells (e.g., CV-1, HuH7, HepG2 or 
Caco2 cells) together with a reported construct. The reporter 

30 construct may comprise one or more (e.g., 5) copies of the DNA 
binding site recognized by the binding domain present in the 
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chimera (e.g., the GAL 4 DNA binding site) driving expression* 
of a reporter gene (e.g., CAT, SPAP or luciferase) . Cells 
containing the constructs are then treated with either vehicle 
alone or vehicle containing test compound, and the level of 
5 expression of the reporter gene determined. In accordance 
with this assay, enhancement of expression of the reporter 
gene in the presence of the test compound indicates that the 
test compound activates the selected PXR and thus can function 
as an inducer of CYP3A4 gene expression in that species. 

10 Another format suitable for use in connection with the 

present invention is the yeast two-hybrid assay. This is an 
established approach to detect protein-protein interactions 
that is performed in yeast. Protein #1, representing the 
bait, is expressed in yeast as a chimera with a DNA binding 

15 domain (e.g., GAL4). Protein #2, representing the predator, 
is expressed in the same yeast cell as a chimera with a strong 
transcriptional activation domain. The interaction of bait 
and predator results in the activation of a reporter gene 
(e.g., luciferase or p-galactosidase) or the regulation of a 

20 selectable marker (e.g., LEU2 gene). This approach can be 
used as a screen to detect, for example, ligand-dependent 
interactions between a selected PXR and other proteins such as 
coactivator proteins (e.g., SRCI, TIFI, TIF2, ACTR) or 
fragments thereof (Fields et al., Nature 340:245-2.46 (1989)). 

25 Still another format is the ligand-induced complex 

formation (LIC) assay. This assay detects ligand-mediated 
effects on nuclear receptor-DNA interactions. A selected PXR 
(or, minimally, DNA and/or ligand binding domains thereof) can 
be incubated with its heterodimeric partner RXR in the 

30 presence of DNA representing an established PXR/RXR binding 
site. Test-compounds can be assayed for their ability to 
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either enhance or interfere with binding of the PXR/RXR* 
heterodimer to DNA (Forman et al, Proc. Natl. Acad. Sci. USA 
94:4312-4317 (1997). 

In a preferred embodiment, the screening assay takes the 

5 form of a FRET (Fluorescence Resonance Emission Transfer assay 
. (Nichols et al. (1998) Anal. Biochem. 257:112-119). This 
screening assay comprises the steps of exposing a sample 
portion comprising the donor located at a first position and 
the acceptor located at the second position to light at a 

10 first wavelength capable of inducing a first electronic 
transition in the donor. The donor comprises a complex of a 
lanthanide chelate and a lanthanide capable of binding the 
chelate. The spectral overlap of the donor emission and 
acceptor absorption is sufficient to enable energy transfer 

15 from the donor to the acceptor as measured by a detectable 
increase in acceptor luminescence. In a preferred embodiment, 
a SRC-1 (LCD2, 677-696) lanthanide chelate is used. 
Preferably, the lanthanide element comprises Europium and the 
signal chelate comprises Europium bound to a PXR of the 

20 present invention. A signal pair comprising Europium bound to 
the PXR and APC (allophycocyanin) bound to SRC-1 (see, e.g., 
Parks et al. (1999) Science 284:1365-1368) can also be used. 

Suitable test compounds which can be screened in the 
above assays include combinatorial libraries, defined chemical 
25 entities and compounds, peptide and peptide mimetics, 
oligonucleotides, natural product libraries such as display 
libraries (e.g. phage display libraries), and antibody 
products . 

Typically, organic molecules, preferably small organic 
30 molecules which have a molecular weight of from 50 to 2500 
daltons, are screened. Candidate test compounds can be 
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biomolecules including, saccharides, fatty acids, steroids,- 
purines, pyrimidines, derivatives, structural analogs and 
combinations thereof. Such test compounds are obtained from 
a wide variety of sources including libraries of synthetic and 
5 natural compounds. Further, known pharmacological agents can 
be subjected to directed or random chemical modifications, 
such as acylation, alkylation, esterif ication, amidif ication, 
etc. to produce structural analogs. 

Test compounds can be used in an initial screen of, for 
10 example, 10. compounds per reaction, and the compounds of these 
batches which show inhibition or activation re-screened 
individually. Test compounds may be screened at a 
concentration of from 1 nM to 1000 jiM, preferably from 1 uM to 
100 uM, more preferably from 1 jiM to 10 uM. Preferably, the 
15 activity of a test compound is compared to the activity shown 
by a known activator or inhibitor. A test compound which acts 
as an inhibitor preferably produces a 50% inhibition of 
activity of the receptor. Alternatively a test compound which 
acts as an activator preferably produces 50% of the maximal 
20 activity produced using a known activator. 

Comparative pharmacology involves the use of a nonhuman 
animal model to either predict the effects of a compound in 
humans, or provide a contrast to the effects of a compound in 
humans. Accordingly, results from assays such as described 
25 above with the PXRs of the present invention provide important 
information with respect to whether a compound of interest 
modulates the receptor similarly to or differently from the 
analogous human receptor. 

Further, comparison of activation of human PXRs with 
30 activation of a non-human PXR of the present invention or PXRs 
from other nonhuman animals which are used as models in 
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preclinical studies is useful in selection of preclinical* 
animal models predictive of affects of a test compound on P450 
3A4 in humans. In this method, in vitro activation of human 
PXR in the presence of a test compound is compared with in 
vitro activation of PXRs from various preclinical animal 
models, including, but not limited to the PXRs of the present 
invention, in the presence of the same test compound. A PXR 
from a preclinical animal model exhibiting similar in vitro 
activation to the human PXR in the presence of the test 
compound is indicative of the preclinical animal model being 
predictive of the affects of the test compound on P450 3A4 in 
humans . 

Activation of the nonhuman PXRs of the present invention 
was examined in the presence of various steroids, xenobiotics, 
and bile acids. Steroids and xenobiotics screened for effects 
on activation of the nonhuman PXRs of the present invention 
included: pregnenolone 16a-carbonitrile (PCN) ; rifampicine; 3 
amino ethyl benzoate; TCPOBOP ( 1, 4-bis [2- ( 3, 5- 

dichloropyridyloxy) ] benzene) ; epoxycholesterol ; RU 486; 
omeprazole; primidone; ethosuximide; nifedipine; metyrapone; 
reserpine; trans-nanochlor; androstanol; clofibrate; clofibric 
acid; troglitazone; 6, 16-dimethylpregnenolone; pregnenolone; 
17a-OH pregnenolone; progesterone; 17a-OH-progesterone 5b- 
pregnane 3,20-dione; corticosterone; cortisone; DHEA 
(dehydroepiandrosterone) ; DHT (dihydrotestosterone) ; 

spironolactone; b-estradiol; tamoxifen; dexamethasone; dex-t- 
butylacetate; hydrocortisone; d-aldosterone; cyproterone 
acetate; hyperforin; phenobarbital; carbamazepine; phenytoin; 
clotrimazole; SR12813; lovastatin; mevastatin; squalestatin; 
and chlorpromazine . Bile acids screened for effects on 
activation of the nonhuman PXRs of the present invention 
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included: 12-ketolithocholic acid; 3, 6-diketocholanic acid; ■ 
3, 7-diketocholanic acid; 3a, 7a-dihydroxy-12-ketocholanic 
acid; 6-ketolithocholic acid; 7 , 12-dike'tolithocholic acid; 7- 
ketodeoxycholic acid; 7-ketolithocholic acid; chenodeoxycholic 
5 acid; cholic acid; dehydrolithocholic acid; deoxycholic acid; 
7-ketodeoxycholic acid methyl ester; glychochenodeoxycholic 
acid; glycocholic acid; glycodehydrocholic acid; 
glycodeoxycholic acid; glycohyocholic acid; 

glychohyodeoxycholic acid; taurodeoxycholic acid; 

10 glycolithocholic acid; hyocholic acid; hyodeoxycholic acid; 
lithocholic acid; murocholic acid; taurochenodeoxycholic acid; 
taurocholanic acid; taurocholic acid; taurodehydrocholic acid; 
taurohyocholic acid; taurodeoxycholic acid; taurolithocholic 
acid; tauro-b-muricholic acid; ursocholanic acid; 
15 ursodeoxycholic acid; a-muricholic acid; b-muricholic acid; 
5b-cholanic acid-7a, 12a-diol-3-one; and 5b-cholanic acid- 
3, 7, 12-trione. The co-transactivation assay described in 
Example 2 was used to screen these compounds. Activation was 
assessed via measurement of levels of secreted placental 
20 alkaline phosphates normalized (normalized SPAP) to 
transfection levels of a control gene. Results from these 
experiments are depicted in Figures 1-8.' XenoBiotics and 
steroids were assessed at a concentration of 10 |iM unless 
otherwise indicated on the graph. Bile acids were assessed at 
25 a concentration of 100 |iM unless otherwise indicated on the 
graph . 

As can be seen from these Experiments, PXRs from 
different species exhibited varying activation patterns in the 
presence of the same compounds. Similar activation screening 
30 assays "to these can be performed with other test compounds, 
most preferably new drugs in development. Results from these 
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assays are useful in PXR comparative pharmacology and* 
selecting appropriate animal models for preclinical studies 
predictive of effects in humans. 

The following nonlimiting examples further illustrate the 
5 present invention. 

EXAMPLES 

Example 1: Characterization of the sequence 

Four PXR LBD sequences from pig, dog, zebrafish, and 
rhesus monkey were cloned. The isolation of each sequence was 

10 achieved using essentially the same strategy for each. A 
small stretch of the LBD was obtained using either cross- 
hybridizing PCR primers from another species, or by finding 
some portion of the LBD sequence in the EST database. The 
remainder of the LBD was subsequently isolated by PCR 

15 amplification of flanking sequence using a primer from within 
the starting sequence combined with either A) a degenerate 
oligo representing the canonical P-box of the DBD to isolate 
5' sequence, or B) oligo d(T) 2 o _ G, d(T) 2 o-C,or d(T) 2 o-A to 
isolate 3' sequence. After deriving the sequence to the poly 

20 (A) tail, the full-length LBD was produced using primers 
flanking the coding sequence . Wild- type sequence was 
determined through examination of at least three independent 
amplifications of each full-length LBD. 

To clone pig PXR LBD, total mRNA was prepared from frozen 
25 pig liver (lg) using the FastTrack 2.0 RNA Preparation kit 
(InVitrogen, San Diego, CA) . Oligo d(T) -primed cDNA synthesis 
was carried out by RT-PCR using a cDNA Cycle Kit (InVitrogen, 
San Diego, CA) . An approximately 250 base pair stretch of pig 
PXR LBD was amplified from this cDNA using homologous mouse 
30 PXR LBD primers. 
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To clone dog PXR LBD, human PXR LBD primers were used to* 
amplify an approximately 450 base pair fragment from a dog 
liver 5' -stretch Xgtll cDNA library (Clontech, Palo Alto, CA) . 

To clone rhesus PXR LBD, human primers were used to 
amplify all but the termini of the rhesus PXR LBD from a 
rhesus liver cDNA library. Primer sequences 5'-TGC CGT GTA 
TGT GGG GAC AAG GC-3' (SEQ ID NO: 9) and 5'-GGC ATG AAG AA GAG 
ATG ATC ATG-3' (SEQ ID NO: 10) were used to amplify a 274 base 
pair fragment from a Rhesus liver library constructed in the 
CMVSport6 vector. The fragment was sequenced and new primers 
were designed based on this Rhesus sequence. These primers 
were then used with vector arm primers to amplify the regions 
5' and 3' of the known sequenced fragment. 

To clone zebrafish (Danio rerio) PXR LBD, an initial 
fragment of the PXR LBD was identified as an EST sequence 
(Accession # AI943313) . This sequence was used to design 
primers for amplification of the entire LBD from cDNA 
synthesized using zebrafish embryo (48h) oligo d(T) -primed 
cDNA. 

Example 2 : Characterization of Selected PXRs via 

Cotransfection Assays 

In order to assess ligands for their ability to activate 
PXR in a cell-based assay, a transient transfection approach 
was utilized. PXR ligand binding domains were fused to the 
Gal4 DNA binding domain and tested against a reporter gene 
regulated by a Gal4 response element (from the yeast UAS G ) ■ 
Lipof ectamine-based transient transfection assays were 
conducted as described previously (Jones et al., 2000 
Molecular Endocrinology, volume 14, pp. 27-39), except that a 
UAS-tk-SPAP reporter vector was used instead of the (CYP3A1 
DR3) 2 -tk-CAT reporter vector. When testing bile acids, an 
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expression plasmid encoding intestinal bile acid transporte 
(IBAT) was added to facilitate cellular uptake of bile acids 
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What is claimed is : 

1. An isolated pregnane X nuclear receptor polypeptide 
comprising: 

(a) an amino acid sequence of SEQ ID NO: 2, 4, 6 or 8; 

(b) a variant of the amino acid sequence as defined in 
(a) which modulates P450 3A4 levels or activity; or 

(c) a fragment of (a) or (b) which modulates P450 3A4 
levels or activity. 

2. A polypeptide according to claim 1 wherein the 
variant (b) has at least 80% identity to the amino acid 
sequence of SEQ ID NO: 2, 4, 6 or 8 . 

3. A polynucleotide encoding a polypeptide according to 
claim 1. 

4. A polynucleotide according to claim 3 which is a 
cDNA sequence. 

5. A polynucleotide encoding a pregnane X receptor 
polypeptide which modulates P450 3A4 levels or activity, said 
polynucleotide comprising: 

(a) a nucleic acid sequence of SEQ ID NO: 1, 3, 5 or 7; 

(b) a nucleic acid sequence which hybridizes under 
stringent conditions to the nucleic acid sequence as defined 
in (a); 

(c) a nucleic acid sequence that is degenerate as a 
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result of the genetic code to the nucleic acid sequence as- 
defined in (a) or (b) ; or 

(d) a nucleic acid sequence having at least 60% identity 
to the nucleic acid sequence as defined in (a), (b) or (c) . 

6. The polynucleotide of claim 3 wherein the 
polynucleotide encodes amino acids 106 to 434 set forth in SEQ 
ID NO: 4 . 

7. The polynucleotide of claim 3 wherein the 
polynucleotide encodes amino acids 41 to 105 set forth in SEQ 
ID NO:4. 

8. The polynucleotide of claim 3 wherein the 
polynucleotide encodes amino acids 1 to 40 set forth in SEQ ID 
NO: 4 . 

9. A fusion protein comprising: 

(a) a DNA binding or ligand binding domain of the 
pregnane X receptor of claim 1; and 

(b) a non-pregnane X receptor-derived amino acid 
sequence . 

10. An isolated polynucleotide encoding the fusion 
protein of claim 9. 

11. An expression vector comprising a polynucleotide 
according to any one of claims 3 to 8 or 10. 
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12. A host cell comprising an expression vector- 
according to claim 11. 

13. An antibody specific for a polypeptide according to 
claim 1. 

14. A method for the identification of a compound that 
modulates pregnane X receptor activity and/or expression, said 
method comprising: 

(a) contacting a test compound with a canine, porcine, 
primate or Zebrafish pregnane X recpetor polypeptide or 
polynucleotide ; and 

(b) determining an effect of the test compound on the 
activity and/or expression of said polypeptide or 
polynucleotide. 

15. A method according to claim 14 wherein the 
polypeptide is expressed in a cell. 

16. A substance which modulates pregnane X receptor 

activity and which is identifiable by a method according to 
claim 14 or 15. 

17. A non-human transgenic animal expressing a PXR 
polypeptide of claim 1 or a mutant thereof. 

18. A method for selecting a preclinical animal model 
which is predictive of affects of a test compound on P450 3A4 
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in humans comprising comparing in vitro activation of human- 
PXR in the presence of the test compound with in vitro 
activation of PXRs from preclinical animal models in the 
presence of the test compound, wherein a PXR from a 
preclinical animal model exhibiting similar in vitro 
activation to the human PXR is indicative of the preclinical 
animal model being predictive of the affects of the test 
compound on P450 3A4 in humans. 

19. The method of claim 18 wherein the pregnane X 
receptor from the preclinical model is a canine, porcine, 
primate or Zebrafish pregnane X receptor. 
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Normiliztd SP 



Vehicle 

Pregnenolone l6a-carbonitrile 
Rif ampicin 
•3 amino ethyl benzoate O.lmM 
TCPOBOP 
Epoxy choles terol 
RU 486 
Omeprazole 
Primidone 
E tho suximide 
Nifedipine 
Metyrapone 
Reserpine 
trans-Kanochlor 
Androstanol 
Clofibrate 
Clofibric Acid 1 nM 
Troglitaxone 
6 , 16-Dimetbylpregnenolone 
Pregnenolone 
17a— OH Pregnenolone 
Progesterone 
17a-OH progesterone 
Sb-pregnane 3,20- dione 
Cortices terone 
Cortisone 
DHEA 
DHT 

Spironolactone 
b-Estradiol 
Tamoxifen 
Dexame thasone 
Dex-t-butylacetate 
Hydrocortisone 
d-Aldos terone 
Cypro terone acetate 
Hyperforin 
Phenobarbital lmM 
Phenobarbital O.lmM 
Carbamazep ine 
Phenytoin 
Clotrimazole 
SR12813 
Lovastatin 
Mevastatin 
Square statin 
Chlorpromazine 
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Normalized SP 



Vehicle 

Pregnenolone 16a-caxbonifcrile 

Rif ampicin 3 
3 amino ethyl benzoate O.lmM 
TCPOBOP 
Epoxy cholesterol 
R0 486 
Omeprazole 
Primidone 
Ethosnximide 
Nifedipine 
Metyrapone 
Reserpine 
trans-Hanocnlor 
Androstanol 
Clofibrate 
Clofibric Acid 1 mM^ ) 
Troglitazone 
6 , 16-Dimethylpregnenolane 
Pregnenolone 
17a-OH Pregnenolone 
Progesterone 
17 a. -OH progesterone 
Sb-pregna ne 3,20- dione 
Corticostexone 
Cortisone 



DHEA 
DHT 

Spironolactone 
b-Estradiol 
Tamoxifen 
Dexaaethasone 
Dex- t-bu ty lace tate 
Hydrocortisone 
d-Aldosterone 
Cyproterone acetate 
Hyperf orin 
Phenobarbital ImM 
Phenobarbital O.lmM 
Carbamazepine 
Phenytoin 
Clotrimazole 
SR12813 
Lovastatin 
Mevastatin 
SquaJLes t-a tin 
Chlorpromazine 
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Normalized SP 



Vehicle 

Pregnenolone 16a-carboni.tr lie 
Rifampicin 
3 amino ethyl benzoate O.lmM 
TCPOBOP 
Epoxy cholesterol 
R0 486 
Omeprazole 
Primidone 
Ethosuxinide 
Nifedipine 
Metyrapooe 
Raserpine 
trans-Kanochlor 
Androstanol 
Clofibrate 
Clof ibric Acid 1 «*f 
Troglitazone 
6 , 16-Dinbethylpregnenolone 3 
Pregnenolone 
17a-OH Pregnenolone 
Progesterone 
17a— OH progesterone 
5b-rpregnane 3,20- dione 
Corticosterone 
Coirtisone 
DHEA 
DOT 

Spironolactone 
b-Estradiol 

Oexametnasone 
Dex-t-butylacetate 
Hydrocortisone 
d- Aldo s terone 
Cypr o terone acetate 
Hyperfbrin 
Phenobarbital lnM 
Phenobarbital 0 .lmM 
Carbamazepine 
Phenytoin 
Clotrimazole 
SR12813 
Lovastatin 
Me va statin 
Squalestatin 
Chlorpromarine 
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Normalised SP 



Vehicle 

Pregnenolone 1 6a-carboni trile 
Rif ampicin 
3 amino ethyl benzoate O.lmM 
TCPOBOP 
Epoxycholesterol 
R0 486 
Omeprazole 
Primidone 
Ethosoximide 
Nifedipine 
Metyrapone 
Raserpine 
trans -Nanochlor 
Aadrostanol 
Clof i-brate 
Clof ibric Acid 1 nM 
Troglitaxone 
6 , 16-Dijnethylpregnenolone 
Pregnenolone 
17a-OH Pregnenolone 
Progesterone 
17a— OH progesterone 
5b-pregnane 3,20- dione 
Cortices terone 
Cortisone 
DHEA 
DHT 

Spironolactone 
b-Estradiol 
Tamoxifen 
Dexaaa thasone 
Dex-t-batylacetate 
Hydrocortisone 
d-Aldosterone 
Cyproterone acetate 
Hype rf or in 
Phenobarbital lmM 
Phenobarbital 0 . 3_mM 
Carbamaz epine 
Phenytoin 
Clotrimazole 
SR12813 
Lovastatin 
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Mevastatin 
Squalestatin 
Chlorpromazine 
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Monaaftxed SPA 



Vehicle* 

12-Ketolithocholic "acid 

3 . 6- DikeCocholanic acid 

3.7- Diketocholanic acid 
3a, 7a-Dihydroxy-12-lceto chnIam c acid 

6- Ketolithocbolic acid 
7 # 12-Di>cetolithocholic acid 

7- fcetodeaxycholic acid 
7-Ketolithocholic acid 

ChenodeoTycholic acid 
CboXic acid 
Dehydrolithocholic acid 
Deaxycholic acid 
7-Ketodeoxycholic acid methyl ester 
Glychochenodeoxycholic acid 
Glycocbolic acid 
Glycodebydrocholic acid 
Glycodeoxycholic acid 
Glycohyocholic acid 
Glychohyodeoxycbolic acid 
Taurodeoocycholic acid 
Glycolithocholic acid 
Hyocholic acid 
Hyodeoxycholic acid 
Iiithocbolic acid 
Murocholic acid 
Taurochenodecocycholic acid 
Taurocholanic acid 
Taoxocbolic acid 
Taurodetaydrocfaolic acid 
Taurohyocholic acid 
Taurodeoxycholic acid 
Taurolithocholic acid 
Tauroursodeoxycholic acid 
Tauro-b-oiuricholic acid 
Ursocholanic acid 
Ursodeoxycholic acid 
a-Muricholic acid 
b-Kuricbolic acid 
5b-Cholanic acid-7a, 12a-diol-3 -one 
Sb-Cholanic acid-3 . 7 , 12 - trione 
Rifampicin (10 ulvl) 
POi (10 ulvl) 
SRI12813 (10 ulvl) 
3-amino ethyl benzoate (100 ulvl) 
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HormaGz«rf SPAP 



Vehicle 

12-Ketolitlxocbolic acid 

3.6- Difcetocholanic acid 

3.7- Diketocholaaic add 
3a , 7a-Dibydra3cy-X2-Jcetocholanic acid 

6- Ketolitbocholic acid 
7,12-DiicetoliChocholic acid 

7- Retodeoxycbolic acid 
7-Ketolithocfaolic acid 

Chenodeoxycholic acid 
Qiolic acid 
Dehyxlrolithocholic acid 
Decocycbolic acid 
7-Ketodeoxycbolic acid methyl ester 
Glycboclienodecocyciiolic acid 
Glycocfaolic acid 
. Glycc^leJhydrocholic acid 
Glycodeoocycbolic acid 
Glycohyocholic acid 
Glychohyodecocycholic acid 
TaurodeoocycboXic acid 
Glycolithocbolic acid 
Hyocholic acid 
Hyodecocycholic acid 
litbocholic acid 
Kurocholic acid 
Tauroclienodeoxycbolic acid 
Taurocholanic acid 
Taurocholic acid 
TaurodehydrochQlic acid 
Taurohyocholic acid 
Taurodeoocycbolic acid 
Tatiroiithocholic acid 
Tauroursodeoxycholic acid 
Tauxo-b-omricbolic acid 
Orsocholanic acid 
Ursodeoxycholic acid 
a-Muxicbolic acid 
b-Kuricfaolic acid 
Sb-Cholanic acid-7a,12a-diol-3-ooe 
Sb-Cbolanic acid-3 , 7 . 12 -trione 
RiCampicin (10 ulvl) 
PCN (10 ulvl) 
SRH28U 1 10 ulvl) 
3 -amino ethyl benzoate 1100 ulvl) 
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Normalized SPAP 



mi 



Vehicle 

12-Ketolithocbolic acid 

3.6- Diketocholanic acid 

3.7- DLketocholanic acid 
3a, 7a-Dibydroxy-12-ketocbalaiiic acid 

,6-K£tolithocholic acid 
7, 12-DiXetolitKocbolic acid 
7-Retodeoxycbolic acid 
7-Kefeolithocholic acid 
Qxenodeorycholic acid 
QtoXxc acid 
Debydrolitbocholxc acid 
Deoxycholic add 
7-Ketodeoxycholic acid methyl ester 
Glychochenodeoxycholic acid 
Glycocholic acid 
Glycodehydrocholic acid 
Glycodearycholic acid 
GlycohyochoXic acid 
Glychobyodeoxycholic acid 
Taurodecceycholic acid 
Glycolithocholic acid 
Hyocholic acid 
Byodeoxycholic acid 
Lithocbolic acid 
Kurocbolic acid 
Ta\irochenodeoxycholic acid 
Taurocholanic acid 
Taurocholic acid 
Taurodehydrocholic acid 
Tauro hyocholic acid 
Tauroiiecxycholic acid 
Tattrolithocholic acid 
Tauxour sodeoxycho lie acid 
Tauro-b-«uricholic acid-| 
Ursocholanic acid 
Ursodeoxycholic acid 
a-Muricholic acid 
b-Kuricholic acid 
Sb-Cbolanic acid-7a, l2a-diol-3-one 
Sb-ChoLanic acid-3 , 7, 12-crioae 
Rifampicin (10 ulvl) 
PCN (10 ulvl) 
SRH2813 (10 ulvl) 
3-araino ethyl benzoate (100 ulvl) 
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Normalized SPAP 



Vehicle 

12-KetoIithocholic acid 
3,6-Diketocholanic acid 
3 , 7.-Diketocholanic acid 
3a. 7a-Dihydxoxy--12 -ketocbolanic acid 
fi-Ketolithocholic acid 
i\ 12 -DiXetolithocholic acid 
7-ReCodecocycbolic acid 
7-KetoliChocholic acid 
Chenodeoxycholic acid 
Cholic acid 
Deliydrolithocholic acid 
Decocycliolic acid 
7-Ketodeoxycbolic acid metixyl ester 
GlychoctLenodeoxycholic acid 
Glycocholic acid 
Glycodebydrocholic acid 
Glycodeoxycholic acid 
Glycofayocholic acid 



Glychohyodecocycholic acid 
Taurodecocycholic acid 
Glycolitbocholic acid 
Hyocholic acid 
Hyodeoxycholic acid 
Lithocholic acid 
Muxocbolic acid 
Taurociiexbodeoxyclioli c acid 
Taurocholanic acid 
Tauxocho lie acid 
Taurodehydrocholic acid 
Taurohyocholic acid 
Tauxodeoxycholic acid 
Taurolithocholic acid 
Tauroursodeoxycholic acid 
Taaro-b-muricbolic acid 
Ursocholanic acid 
Ursodeoxycholic acid 
a-Muricholic acid 
b-Muricholic acid 
Sb-Cholanic acid-7a. 12a-diol - 3 -ooe 
Sb-Cholanic acid-3 , 7 . 12 - c rione ^ 
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Nucleotide and Amino Acid Sequence of Canine PXR Ligand 
Binding Domain (SEQ ID NO:l) 

GGCATGAAGAAGGAGATGATCATGTCCGACGCGGCTGTGGAGCAGAGGCGGGCTCTGATCCGGAGGAAAAAG 

GMKKEMIMSDAAVEQRRALIRRKK 
CGAGAACGGATGGGCGCGTCGCCGCTGGGAGCCAAGGGGCTGAGTGAGGAGCAGCAGACGATGATCCGAGAG 

RERMGAS PLGAKGLSEEQQTMIRE 
CTGATGGATGCCCAGATGAAAACCTTTGACACCACCTTCTCCAACTTCAAGGATTTCCGGCTGCCGGCCGCG 

LMDAQMKTFDTTFSNFKDFRLPAA 
TGCAGCAGCGGGCGCGAGGTCCCAGGAGCGGCGCACACTCCAGTGGGGGAGGAAGCTGCCAAGTGGAGCCAG 

CSSGREVPGAAHTPVGEEAAKWSQ 
GTCAGGGAGGATCTGTGCTCGCTGAAGGTGTGCCTGCGGCTGCGCGGGGAGGACGGCAGCGTCCAGAACTAC 

VREDLCSLKVCLRLRGEDGSVQNY 

ACACCCCAGGCCGACCGCAGCGGCGCCGAGATCTTTTCCCTGCTGCCCCACATGGCTGACATGTCCACCTAC 
TPQADRSGAEIFSLLPHMADMSTY 

ATGTTCAAAGGCGTCATCAACTTTGCCAAAGTCATCTCCCACTTCAGGGAATTGCCCATCGAGGACCAGATC 

MFKGVINFAKVISHFRELPIEDQI 
TCGCTGCTAAAGGGGGCCACCTTCGAGGTGTGCCAGCTGAGGTTCAACACGGTGTTCAACGCAGAGACCGGA 

SLLKGAT FEVCQLRFNTVFNAETG 
ACCTGGGAGTGTGGCCGGCTGTCCTACTGCCTGGAAGACCCTGCAGGCGGCTTCCAGCAGCTTCTCCTGGAG 

TWECGRLSYCLEDPAGG FQQLLLE 
CCAGTGCTGAAGTTCCACTACAGGCTGAAGAGGCTGCAGCTGCATAAGGAGGAGTATGTGCTGATGCAGGCC 

PVLKFHYRLKRLQLHKEEYVLMQA 
ATCTCTCTTTTCTCCCCAGACCGCCCAGGTGTGGTGCAGCGCAGCGTGGTGGACCAGCTGCAGGAGAGATTT 

ISLFSPDRPGVVQRSVVDQLQERF 
GCCATCGCCCTGAAGGCCTACATCGAGTGCAATCGGCCGCAGCCTGCCCACCGGTTCCTGTTCCTGAAGATC 

AIALKAY IECNRPQPAHRFLFLKI 

ATGGCCATGCTCACCGAGCTTCGCAGCATCAATGCCCAGCACACCCAGAAGCTGCTGCGCATCCAGGACATA 
MAMLTELRS INAQHTQKLLRIQ DI 

CACCCCTTCGCCAGCCCCCTCATGCAGGAGCTGTTCAGCATCACGGACGGCTGA 
HPFASPLMQELFSITD G* 



Amino Acid Sequence of Canine PXR Ligand Binding Domain (SEQ 
ID NO: 2) 

GMKKEMIMSD AAVEQRRALI RRKKRERMGA SPLGAKGLSE EQQTMIRELM 
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DAQMKTFDTT FSNFKDFRLP AACSSGREVP GAAHTPVGEE AAKWSQVRED 
LCSLKVCLRL RGEDGSVQNY TPQADRSGAE IFSLLPHMAD MSTYMFKGVI 
NFAKVISHFR ELPIEDQISL LKGATFEVCQ LRFNTVFNAE TGTWECGRLS 
YCLEDPAGGF QQLLLEPVLK FHYRLKRLQL HKEEYVLMQA ISLFSPDRPG 
VVQRSVVDQL QERFAIALKA YIECNRPQPA HRFLFLKIMA MLTELRSINA 
QHTQKLLRIQ DIHPFASPLM QELFSITDG 



Nucleotide and Amino Acid Sequence of Full-Length Primate 
PXR (SEQ ID NO: 3) 



TCCTTGGTAAAGCTACTCCTTGATCGATCCTTTGCACCTGATTGTTCAAAGTGGACCCCAGGGG 
GAAGTCAGAGCGAAGAACTTACCGCTAAGCAGTCCAAGAGGCCCAGAAGCAAAC 

CTGGAGGTGAGACCCAAAGAAGGCTGGAACCATGCTGACTTTGTATACTGTGAGGACACAGAGTTTGCTCCT 

LEVRPKEGWNHADFVYCEDTE FAP 
GGAAAGCCCACTGTCAACGCAGATGAGGAAGTTGGGGGTCCCCAAATCTGCCGTGTATGTGGGGACAAGGCC 

GKPTVNADEEVGGPQICRVCGDKA 
ACTGGTTATCACTTCAATGTCATGACATGTGAAGGATGCAAGGGCTTTTTCAGGAGGGCCATGAAACGCAAC 

TGYH FNVMTCEGCKGFFRRAMKRN 
GCCCGCCTTAGGTGCCCCTTCCGGAAGGGCGCCTGCGAGATCACCCGGAAGACCCGGCGACAGTGCCAGGCC 

ARLRCPFRKGACEI TRKTRRQCQA 
TGCCGGCTGCGCAAGTGCCTGGAGAGCGGCATGAAGAAGGAGATGATCATGTCCGACGCGGCCGTAGAGGAG 

CRLRKCLESGMKKEMIMSDAAVEE 
AGGCGGGCCTTGATCAAGAGGAAGAAAAGAGAACGGATCGGGACTCAGCCACCCGGAGTGCAGGGGCTGACG 

RRAL IKRKKRERIGTQPPGVQGLT 
GAGGAGCAGCGGATGATGATCAGGGAGCTGATGGACGCTCAGATGAAAACCTTTGACACTACCTTCTCCCAT 

EEQRMMI RELMDAQMKTFDTT FSH 
TTCAAGAATTTCCGGCTGCCAGGGGTGCTTAGCAGTGGCTGTGAGATGCCAGAGTCTCTGCAGGCCCCATCG 

FKNFRLPGVLS SGCEMPESLQAPS 
AGGGAAGAAGCTGCCAAGTGGAACCAGGTCAGGAAAGATCTGTGGTCTGTGAAGGTCTCCGTGCAGCTGCGG 

REEAAKWNQVRKDLWSVKVSVQLR 
GGGGAGGATGGCAGTGTCTGGAACTACAAACCCCCAGCCGACAATGGCGGGAAAGAGATCTTCTCCTTGCTG 

GEDGSVW NYKPPADNGGKEI FSLL 
CCCCACATGGCTGACATGTCAACCTACATGTTCAAAGGCATCATCAACTTTGCCAAAGTCATCTCCTACTTC 

PHMADMSTYMFKGI INFAKVI SYF 
AGGGACCTGCCCATCGAGGACCAGATCTCCCTACTGAAGGGGGCCACTTTTGAGCTGTGCCAGCTGAGATTC 

RDLPIEDQISLLKGATFELCQLRF 
AACACAGTATTCAACGCGGAGACTGGAACTTGGGAGTGTGGCCGGCTGTCCTACTGCTTGGAAGACCCTGCA 

NTVFNAETGTWECGRLSYCLEDPA 
GGTGGTTTCCAGCAACTTCTGCTGGAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGCTGCAGCTACAC 

GGFQQLLLEPMLKFHYMLKKLQLH 
GAGGAGGAGTATGTGCTGATGCAGGCCATCTCCCTCTTCTCCCCAGACCGCCCAGGTGTGGTGCAGCACCGC 

EEEYVLMQAISLFSPDRPGVVQHR 
GTGGTGGACCAGCTGCAGGAGCAATACGCTATTACTCTGAAGTCCTACATTGAATGCAATCGGCCCCAGCCT 

VVDQLQEQYAITLKSYIECNRPQP 
GCTCATAGGTTCCTGTTCCTGAAGATCATGGCTATGCTCACCGAGCTCCGCAGCATCAACGCCCAGCACACC 

AHRFLFLKIMAMLTELRSINAQHT 
CAGCGGCTGCTGCGCATCCAGGACATACACCCCTTTGCTACGCCCCTCATGCAGGAGTTGTTCGGCATCACG 

QRLLRIQDIHPFATPLMQELFGIT 
GGTAGCTGAGTGGCTGCCCTTGGGTGA 

G S * 
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Amino Acid Sequence of Full-Length Primate PXR (SEQ ID NO: 4) . 

LEVRPKEGWNHADFVYCEDTEFAPGKPTVNADEEVGGPQICRVCGDKA 

TGYHFNVMTCEGCKGFFRRAMKRNARLRCPFRKGACEITRKTRRQCQA 

CRLRKCLESGMKKEMIMSDAAVEERRALIKRKKRERIGTQPPGVQGLT 

EEQRMMIRELMDAQMKTFDTTFSHFKNFRLPGVLSSGCEMPESLQAPS 

REEAAKWNQVRKDLWSVKVSVQLRGEDGSWNYKPPADNGGKEIFSLL 

PHMADMST YMFKGI INFAKVI S YFRDLPIEDQI SLLKGATFELCQLRF 

NTVFNAETGTWECGRLSYCLEDPAGGFQQLLLEPMLKFHYMLKKLQLH 

EEEYVLMQAISLFSPDRPGVVQHRWDQLQEQYAITLKSYIECNRPQP 

AHRFLFLKIMAMLTELRS INAQHTQRLLRIQDI H PFAT PLMQELFGIT , 

GS 



Nucleotide and Amino Acid Sequence of Porcine PXR Ligand 
Binding Domain (SEQ ID NO: 5) 

GGCATGAGGAAGGAAATGATCATGTCAGATGCAGCTGTGGAGCAGAGGCGGGCCTTGATCAGGAGGAAGAAA 

GMRKEM IMSDAAVEQRRAL IRRKK 
CGAGAACAGATCGGGGCTCAGCCCCCAGGAGCCAAGGGTCTCACTGAAGAGCAGCGGACAATGATCAGTGAG 

REQIGAQPPGAKGLTEEQRTMISE 
CTGATGAACGCTCAGATGAAAACCTTTGACACCACCTTCACACATTTCAAGAATTTTCGGTTACCAGAGGTG 

LMNAQMKTFDTTFTHFKNFRLPEV 
CTTAGCAGTAGCCTCGAGATTCCAGAGTGTCTGCAGACTCCGTCGTCAAGGGAAGAAGCTGCCAAGTGGAGC 

LSSSLEI PECLQTPSSREEAAKWS 

AAGCTCAGGGAAGATCTGTGCTCAGTGAAACTCTCTCTGCAGCTAAGGGGGGAAGATGGTAGCGTCTGGAAC 

KLREDLCSVKLSLQLRGEDGSVWN 
TACAAACCCCCAGCAGACAACAGTGGGAAAGAGATCTTTTCCCTGCTGCCCCACATAGCTGACATGTCAACC 

YKPPADNSGKEIFSLLPHIADMST 
TACATGTTCAAAGGCATTATCAACTTTGCCAAAGTCATCTCCTACTTCAGGGACTTGCCCATTGAGGACCAG 

YMFKGI INFAKVISYFRDLPIEDQ 
ATCTCTCTGCTGAAGGGGGCCACCTTTGAGCTGTGCCAGCTGAGATTCAACACGGTGTTCAACGCAGAGACG 

ISLLKGATFELCQLRFNTVFNAET 
GGGACCTGGGAGTGTGGTCGGCTGTCCTACAGCTTGGAAGACCCCTCAGGTGGCTTCCAGCAGCTTCTCCTG 

GTWECGRLSYSLEDPSGGFQQLLL 
CAGCCCATGCTGAAATTCCACTACATGCTGAAG7\AGCTGCAGCTGCATAAGGAGGAGTATGTGCTGATGCAG 

QPMLKFHYMLKKLQLHKEEYVLMQ 
GCCATCTCCCTTTTCTCTCCAGACCGCCCGGGTGTGGTGCAACGCCAAGTGGTGGACCAGCTGCAGGAGAGG 

AI SLFS PDRPGVVQR'QVVDQLQER 
TTTGCCATTACCCTGAAGGCCTACATCGAGTGCAACCGGCCCCAGCCTGCCCACCGATTCCTGTTCCTGAAG 
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FAITLKAYIECNRPQPAHRFLFLK. 
ATCATGGCTATGCTCACTGAGCTCCGCAGCATCAACGCCCAACACACCCAGCGGCTGCTGCGAATCCAGGAC 

IMAMLTELRS INAQHTQRLLRIQD 
ATACACCCCTTCGCCACCCCACTCATGCAGGAGTTATTCAGCATCACAGAAAGCTGA 

IHPFATPLMQELFSITES * 



Amino Acid Sequence of Porcine PXR Ligand Binding Domain 
(SEQ ID NO: 6) 

GGCATGAGGAAGGAAATGATCATGTCAGATGCAGCTGTGGAGCAGAGGCGGGCCTTGATCAGGAGGAAGAAA 

GMRKfcMIMSDAAVEQRRALIRRKK 
CGAGAACAGATCGGGGCTCAGCCCCCAGGAGCCAAGGGTCTCACTGAAGAGCAGCGGACAATGATCAGTGAG 

REQIGAQPPGAKGLTEEQRTMISE 
CTGATGAACGCTCAGATGAAAACCTTTGACACCACCTTCACACATTTCAAGAATTTTCGGTTACCAGAGGTG 

LMNAQMKTFDTTFTHFKNFRLPEV 
CTTAGCAGTAGCCTCGAGATTCCAGAGTGTCTGCAGACTCCGTCGTCAAGGGAAGAAGCTGCCAAGTGGAGC 

LSSSLE I PECLQTPSSREEAAKWS 

AAGCTCAGGGAAGATCTGTGCTCAGTGAT^ACTCTCTCTGCAGCTAAGGGGGGAAGATGGTAGCGTCTGGAAC 

KLREDLCSVKLSLQLRGEDGSVWN 
TACAAACCCCCAGCAGACAACAGTGGGAAAGAGATCTTTTCCCTGCTGCCCCACATAGCTGACATGTCAACC 

YKPPADNSGKEI FSLLPHIADMST 
TACATGTTCAAAGGCATTATCAACTTTGCCAAAGTCATCTCCTACTTCAGGGACTTGCCCATTGAGGACCAG 

YMFKGI INFAKVISYFRDLPIEDQ 
ATCTCTCTGCTGAAGGGGGCCACCTTTGAGCTGTGCCAGCTGAGATTCAACACGGTGTTCAACGCAGAGACG 

ISLLK GAT FELCQLRFNTVFNAET 
GGGACCTGGGAGTGTGGTCGGCTGTCCTACAGCTTGGAAGACCCCTCAGGTGGCTTCCAGCAGCTTCTCCTG 

GTWECGRLSYSLEDPSGGFQQLLL 
CAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGCTGCAGCTGCATAAGGAGGAGTATGTGCTGATGCAG 

QPMLKFHYMLKKLQLHKEEYVLMQ 
GCCATCTCCCTTTTCTCTCCAGACCGCCCGGGTGTGGTGCAACGCCAAGTGGTGGACCAGCTGCAGGAGAGG 

AI SLFS PDRPGVVQRQVVDQLQER 
TTTGCCATTACCCTGAAGGCCTACATCGAGTGCAACCGGCCCCAGCCTGCCCACCGATTCCTGTTCCTGAAG 

FAITLKAYIECNRPQPAHRFLFLK 
ATCATGGCTATGCTCACTGAGCTCCGCAGCATCAACGCCCAACACACCCAGCGGCTGCTGCGAATCCAGGAC 

IMAMLTELRSINAQHTQRLLRIQD 
ATACACCCCTTCGCCACCCCACTCATGCAGGAGTTATTCAGCATCACAGAAAGCTGA 

IHPFATPLMQELFSITES* 



4/6 



WO 02/094865 



PCT/US02/16445 



Nucleotide and Amino Acid Sequence of Zebrafish PXR Ligand 
Binding Domain (SEQ ID NO: 7) 

GGCATGAAGAGAGAGCTGATCATGTCGGATGAGGCGGTGGAGAAGCGGAGGTTGCAGATCAGGAGGAAGAGG 

GMKRELIMSDEAVEKRRLQIRRKR 
ATGCAGGAAGAGCCTGTAACTCTCACTCCTCAACAGGAAGCTGTCATACAAGAGCTGCTYAACGCACACAAG 

MQEEPVTLTPQQEAVIQELLNAHK 
AAAACCTTCGACATGACTTGTGCCCATTTCAGTCAGTTCCGGCCTTTAGATCGGGATCAGAAGTCTGTGTCC 

KTFDMTCAHFSQFRPLDRDQKSVS 
GAGTCGAGTCCACTCACAAACGGCAGCTGGATCGATCACAGACCCATCGCTGAAGACCCAATGCAGTGGGTC 

ESSPLT.NGSWIDHRPIAEDPMQWV 
TTCAATCCCACTTtGCTCTCGTCCTCTTCCTCCAGCTACCAGAGCCTTGACAATAAAGAGAAGAAGCACTTT 

FN PTSLSSSSSSYQSLDNKEKKHF 
AAAAGTGGCAACTTCTCCTCTCTGCCACACTTCACAGACCTCACCACGTACATGATCAAGAATGTCATCAAC 

KS GNFSSLPHFTDLTTYMIKNVIN 
TTCGGGAAGACGCTGACAATGTTTAGGGCTCTGGTTATGGAGGACCAGATCTCGCTGCTGAAAGGTGCCACC 

FGKTLTMFRALVMEDQI SLLKGAT 
TTTGAGATCATTCTGATTCACTTCAACATGTTCTTTAATGAAGTGACGGGAATTTGGGAGTGCGGCCCCCTG 

FEIILIHFNMFFNEVTGIWECGPL 
CAGTACTGCATGGATGATGCCTTTCGAGCTGGTTTTCAGCACCATCTGCTGGACCCAATGATGAATTTCCAT 

QYCMDDAFRAGFQHHLLDPMMNFH 
TACACACTGCGTAAGCTGCGTTTGCATGAGGAGGAGTATGTGCTGATGCAGGCCCTCTCTCTCTTTTCACCA 

YTLRKLRLHEEEYVLMQALSLFSP 
GATCGCCCTGGTGTGACAGACCACAAAGTGATCGACCGCAACCAGGAAACACTAGCGCTTACCCTAAAGACT 

DRPGVTDHKVI DRNQETLALTLKT 
TACATTGAGGCCAAGAGAAATGGGCCAGAAAAACATCTGCTGTTCCC7VAAGATTATGGGGTGCCTGACCGAG 

YI EAKRNGPEK HLLFPKIMGCLTE 
ATGAGGAGCATGAACGAAGAGTACACCAAACAAGTGCTGAAAATCCAGGACATGCAGCCTGAAGTGTCTCCA 

MRSMNEEYTKQVLKI QDMQPEVSP 
CTTTGGTTGGAAATAATAAGCAAAGACACCTAAGTCCTCACAAAGCACGTGCAACTCTACTTATTCCCAACT 

LWLEIISKDT* 



Amino Acid Sequence of Zebrafish PXR Ligand Binding Domain 
(SEQ ID NO: 8) 

GMKRELIMSDEAVEKRRLQIRRKRMQEEPVTLTPQQEAVIQELLNAHK 
KTFDMTCAHFSQFRPLDRDQKSVSESSPLTNGSWIt)HRPIAEDPMQWV 
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FNPTSLSSSSSSYQSLDNKEKKHFKSGNFSSLPHFTDLTTYMIKNVIN 
FGKTLTMFRALVMEDQISLLKGATFEIILIHFNMFFNEVTGIWECGPL 
QYCMDDAFRAGFQHHLLDPMMNFHYTLEIKLRLHEEEYVLMQALSLFSP 
DRPGVTDHKVIDRNQETLALTLKTYIEAKRNGPEKHLLFPKIMGCLTE 
MRSMNEEYTKQVLKIQDMQPEVSPLWLEIISKDT* 
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Description 

CRYSTALLIZED HUMAN XENOBIOTIC NUCLEAR RECEPTOR PXR/SXR 
LIGAND BINDING DOMAIN POLYPEPTIDE AND 
SCREENING METHODS EMPLOYING SAME 

5 

Technical Field 

The present invention relates generally to the structure of the ligand 
binding domain of PXR, and more particularly to the crystalline structure of the 
ligand binding domain of PXR. The invention further relates to methods by 
1 0 which modulators and ligands of PXR can be identified. 





Abbreviations 


ATP 


adenosine triphosphate 


ADP 


adenosine diphosphate 


APS 


Advanced Photon Source 


BSA 


bovine serum albumin 


CBP 


CREB-binding protein 


cDNA 


complementary DNA 


CI 


chemical ionization 


CPS 


counts per second 


CYP 


cytochrome P450 


DBD 


DNA binding domain 


DMSO 


dimethyl sulfoxide 


DNA 


deoxyribonucleic acid 


DTT 


dithiothreitol 


EDTA 


ethylenediaminetetraacetic acid 


El 


electron impact ionization 


ER 


estrogen receptor 


FRET 


fluorescent resonance energy transfer 


GC 


gas chromatography 


GC/MS 


gas chromatography/mass spectrometry 


HEPES 


N-2-Hydroxyethylpiperazine-N-2-ethanesulfonic 
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hPXR 

HRE 

kDa 

LBD 

MAD 

MODY 

MS 

NDP 

nt 

NTP i 

PAGE 

PAR 

PCN 

PCR 

p) 

PPAR 

PR 

PXR 

RAR 

RIF 

RMSD 

RXR 

SDS 

SDS-PAGE 

SIRAS 

SR 
SXR 
TIC 
TR 
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acid 

human pregnane X receptor 

hormone response element 

kilodalton(s) 

ligand binding domain 

multiwavelength anomalous diffraction 

mature onset diabetes of the young 

mass spectrometry 

nucleotide diphosphate 

nucleotide 

nucleotide triphosphate 
polyacrylamide gel electrophoresis 
pregnane-activated receptor 
pregnenolone 16a-carbonitrile 
polymerase chain reaction 
isoelectric point 

peroxisome proliferator-activated receptor 

progesterone receptor 

pregnane X receptor 

retinoic acid receptor 

rifampicin 

root-mean-square deviation 

retinoid X receptor 

sodium dodecyl sulfate 

sodium dodecyl sulfate polyacrylamide gel 

electrophoresis 

single isomorphous replacement anomalous 

scattering 

SR12813 

steroid and xenobiotic receptor 
total ion chromatogram 
thyroid hormone receptor 
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15 



20 



25 



TTR 


plasma transthyretin 




VDR 


vitamin D receptor 




vHNF 


variant hepatocyte nuclear factor 


WT 


wildtype 






Amino Acid Abbreviations 




s-Letter Code 


Three-Letter Code 


Name 


A 


Ala 


Alanine 


V 


Val 


Valine 


L 


Leu 


Leucine 


I ; 


He 


Isoleucine 


P 


Pro 


Proline 


F 


Phe 


Phenylalanine 


W 


Trp 


Tryptophan 


M 


Met 


Methionine 


G 


Gly 


Glycine 


S 


Ser ' 


Serine 


T 


Thr 


Threonine 


C 


Cys 


Cysteine 


Y 


Tyr 


Tyrosine 


N 


Asn 


Asparagine 


Q 


Gin 


Glutamine 


D 


Asp 


Aspartic Acid 


E 


Glu 


Glutamic Acid 


K 


Lys 


Lysine 


R 


Arg 


Arginine 


H 


His 


Histidine 



Functionally Equivalent Codons 
30 Amino Acid Codons 

Alanine Ala A GCAGCCGCGGCU 

Cysteine Cys C UGC UGU 
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Aspartic Acid 


Asp 


D 


GAC GAU 


Glumatic acid 


Glu 


E 


GAA GAG 


Phenylalanine 


Phe 


F 


uuc uuu 


Glycine 


Gly 


G 


GGA GGC GGG GGU 


Histidine 


His 


H 


CAC CAU 


Isoleucine 


He 


1 


AUA AUC AUU 


Lysine 


Lys 


K 


AAA AAG 


Methionine 


Met 


M 


AUG 


Asparagine 


Asn 


N 


AAC AAU 


Proline 


Pro 


■ P 


CCA CCC CCG CCU 


Glutamine 


Gin 


Q 


CAACAG 


Threonine 


Thr 


T 


ACA ACC ACG ACU 


Valine 


Val 


V 


GUAGUCGUGGUU 


Tryptophan 


Trp 


W 


UGG 


Tyrosine 


Tyr 


Y 


UAC UAU 


Leucine 


Leu 


L 


UUA UUG CUA'CUC 








CUG CUU 


Arginine 


Arg 


R 


AGA AGG CGA CGC 








CGG CGU 


Serine 


Ser 


S 


ACG AGU UCA UCC 








UCG UCU 



Background Art 

Nuclear receptors represent a superfamily of proteins that specifically 
25 bind a physiologically relevant small molecule, such as a hormone or vitamin. 
As a result of a molecule binding to a nuclear receptor, the nuclear receptor 
changes the ability of a cell to transcribe DNA, i.e. nuclear receptors modulate 
the transcription of DNA. However they can also have transcription 
independent actions. 

30 Unlike integral membrane receptors and membrane-associated 

receptors, nuclear receptors reside in either the cytoplasm or nucleus of 
eukaryotic cells. Thus nuclear receptors comprise a class of intracellular, 



WO 02/095652 



PCT/US02/15701 



-5- 

soluble ligand-regulated transcription factors. Nuclear receptors include but 
are not limited to receptors for glucocorticoids, androgens, mineralcorticoids, 
progestins, estrogens, thyroid hormones, vitamin D retinoids, and icosanoids. 
Many nuclear receptors, identified by either sequence homology to known 
5 receptors ( See , Drewes et al. , (1996) Mot. Cell. Biol. 16:925-31) or based on 
their affinity for specific DNA binding sites in gene promoters ( See , Sladek et 
ah, Genes Dev. 4:2353-65), have unascertained ligands and are therefore 
termed "orphan receptors". 

Structurally, nuclear receptors are generally characterized by two 
10 distinct structural elements. First, nuclear receptors comprise a central DNA 
binding domairt that targets the receptor to specific DNA sequences, which 
are known as hormone response elements (HREs). The DNA binding 
domains of these receptors are related in structure and sequence, and are 
located within the middle of the receptor. Second, the C-terminal region of 
15 nuclear receptors encompasses the ligand binding domain (LBD). Upon 
binding a ligand, the receptor shifts to a transcriptionally active state. 

The cytochrome P450 (CYP) family of heme-containing proteins play 
critical roles in the oxidative metabolism of drugs and other xenobiotics in the 
liver and small intestine. In particular, the CYP3A gene products bind and 
20 hydroxylate a wide variety of chemical structures, including >50% of all drugs. 
Maurel , (1996) in Cytochrome P450: Metabolic and Toxicological Aspects 
(lonnides, ed.) CRC Press, Inc., Boca Raton, Florida, 241-70. Expression of 
CYP3A is induced at the level of transcription by a variety of xenobiotics, 
including many that are metabolized by CYP3A. 
25 The pregnane X receptor (PXR; NR112), a member of the nuclear 

receptor family of ligand-activated transcription factors, is a key regulator of 
CYP3A gene expression in mammalian liver and small intestine. Kliewer et 
aL, (1998) Cell 92: 73-82; Lehmann et al. , (1998) J. Clin. Invest 102: 1016- 
23; Bertilsson et al. , (1998) Proc. Nat Acad. Sci. U.S.A. 95: 12208-13; 
30 Blumberg et al. , (1998) Gene Dev. 12:3195-205; Xie et al. , (2000) Nature 406: 
435-39. The human ortholog of PXR is alternatively known as pregnane 
activated receptor (PAR) and steroid and xenobiotic receptor (SXR). 
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Bertilsson et al. , (1998) Proc. Nat. Acad. Sci. U.S.A. 95: 12208-13; Blumberg 
et al - C998) Gene Dev. 12: 3195-205. PXR is activated by most of the 
xenobiotics that are known to induce CYP3A gene expression, including 
commonly used drugs such as the antibiotic rifampicin, the glucocorticoid 
5 dexamethasone, and the herbal therapy St. John's wort. Kliewer et al. . (1998) 
Ce//92: 73-82; Lehmann et al. , (1998) J. Clin. Invest. 102: 1016-23; Bertilsson 
et al. , (1998) Proc. Natl. Acad. Sci. U.S.A. 95: 12208-13; Blumberg et al. , 
(1998) Gene Dev. 12: 3195-205; Jones etal. , (2000) Mol. Endocrinol. 14: 27- 
39; Moore et al. , (2000) Proc. Nat. Acad.Sci. U.S.A. 97: 7500-502. 

1 0 Like other nuclear receptors, PXR contains both a DNA binding domain 

and a ligand binding domain (LBD). PXR binds to the xenobiotic response 
elements in the regulatory regions of CYP3A genes as a heterodimer with the 
9-cis retinoic acid receptor (RXR). Kliewer et al. , (1998) Cell 92: 730-82; 
Lehmann et al. . (1998) J. Clin. Invest. 102: 1016-23; Blumberg et al. . (1998) 

15 Gene Dev. 12: 3195-205. 

PXR mediates potentially dangerous drug-drug interactions by 
upregulating CYP3A expression in respdnse to one compound, which in turn 
can then lead to the metabolism of other drugs vital to survival. For example, 
patients taking the herbal anti-depressant St. John's wort have exhibited a 

20 dramatic drop in serum levels of other critical drugs, including the antiretroviral 
drug Indinavir (also known as CRIXIVAN®) and the immunosuppressant 
compound cyclosporin. Piscitelli et al. , (2000) Lancet 355: 547-48; 
Ruschitzka et al. , (2000) Lancet 355: 548-49. The molecular basis of this 
effect was traced to PXR. Moore et al„ (2000) Proc. Nat. Acad.Sci. U.S.A. 

25 97: 7500-502; Wentworth et al. , (2000) J. Endocrinol. 166: R11-R16. 
Hyperforin, a constituent of St. John's wort, activates PXR and upregulates 
CYP3A expression, which leads to the metabolism of these vital drugs. 
Unraveling the molecular basis of PXR activation by ligands could prove 
critical in preventing such potentially deadly cross-reactivities between drugs. 

30 Unlike the steroid, retinoid, and thyroid hormone receptors, which are 

highly selective for their cognate hormone, PXR has evolved to detect 
structurally-diverse compounds. These include drugs and xenobiotics, as well 
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as endogenous compounds, including the toxic bile acid lithocholic acid and 
certain C21 steroids (pregnanes). Kliewer et al. t (1998) Cell 92: 730-82; 
Bertilsson et al. t (1998) Proc. Nat Acad. ScL U.S.A. 95: 12208-13; Jones et 
al., (2000) Mol. Endocrinol. 14: 27-39. Although these diverse interactions 
5 imply a degree of promiscuity, PXR also exhibits specificity. This is illustrated 
by the fact there are marked differences in the pharmacologic activation 
profile of PXR across species. For instance, human PXR is activated 
efficiently by rifampicin and the cholesterol-lowering drug SR12813 ( Berkhout 
et al. , (1996) J. Biol. Chem. 271: 14376-82; Berkhout et al. , (1997) 

1 0 Atherosclerosis 1 33: 203-21 ), whereas mouse PXR is not (Jones et al , (2000) 
Mol. Endocrinol. 14: 27-39). The mouse version of this receptor is activated 
by the synthetic steroid pregnenolone 16a-carbonitrile (PCN), whereas the 
human receptor is not. These data suggest that PXR has evolved in order to 
detect the different xenobiotic challenges faced by different species. By 

15 binding diverse but precise arrays of compounds, PXR exhibits directed 
promiscuity. 

Unraveling the structural basis of how PXR recognizes an array of 
different endogenous and exogenous compounds, including both small and 
large ligands, is critical to understanding how harmful compounds are cleared 

20 from the body. Such knowledge can also improve our ability to predict and 
avoid dangerous drug-drug interactions. 

Polypeptides, including the ligand binding domain of PXR, have a 
three-dimensional structure determined by the primary amino acid sequence 
and the environment surrounding the polypeptide. This three-dimensional 

25 structure establishes the polypeptide's activity, stability, binding affinity, 
binding specificity, and other biochemical attributes. Thus, knowledge of a 
protein's three-dimensional structure can provide much guidance in designing 
agents that mimic, inhibit, or improve its biological activity in soluble or 
membrane bound forms. 

30 The three-dimensional structure of a polypeptide can be determined in 

a number of ways. Many of the most precise methods employ X-ray 
crystallography (See, e.g., Van Holde , (1971) Physical Biochemistry , Prentice- 
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Hall, N. J., 221-39). This technique relies on the ability of crystalline lattices to 
diffract X-rays or other forms of radiation. Diffraction experiments suitable for 
determining the ttiree-dimensional structure of macromolecules typically 
require high-quality crystals. Unfortunately, such crystals have been 
5 unavailable for the ligand binding domain of PXR, as well as many other 
proteins of interest. Thus, high-quality diffracting crystals of the ligand binding 
domain of PXR would greatly assist in the elucidation of PXR's three- 
dimensional structure, and would provide insight into the ligand binding 
properties of PXR. 

10 Clearly, the solved crystal structure of the PXR ligand binding domain 

would be useful in the design of modulators of activity mediated by all PXR 
isoforms. Evaluation of the available sequence data has made it clear that 
PXR shows structural homology with the three-dimensional fold of other 
proteins. 

15 A solved PXR-ligand crystal structure would provide structural details 

and insights necessary to design a modulator of PXR that maximizes 
preferred requirements for any modulator, i.e. potency and specificity. By 
exploiting the structural details obtained from a PXR-ligand crystal structure, it 
would be possible to design a PXR modulator that, despite PXR's similarity , 

20 with other proteins, exploits the unique structural features of PXR. A PXR 
modulator developed using structure-assisted design would take advantage of 
heretofore unknown PXR structural considerations and thus be more effective 
than a modulator developed using homology-based design. Potential or 
existent homology models cannot provide the necessary degree of specificity. 

25 A PXR modulator designed using the structural coordinates of a crystalline 
form of PXR would also provide a starting point for the development of 
modulators of other structurally similar proteins. 

What is needed, therefore, is a crystallized form of a PXR LBD 
polypeptide, preferably in complex with a ligand. Acquisition of crystals of the 

30 PXR LBD polypeptide will permit the three dimensional structure of the PXR 
LBD to be determined. Knowledge of this three dimensional structure will 
facilitate the design of modulators of PXR activity. Such modulators can lead 
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to therapeutic compounds to treat a wide range of conditions, including 
conditions associated with toxic xenobiotics. 

Summary of the Invention 
5 A substantially pure PXR ligand binding domain polypeptide in 

crystalline form is disclosed, Preferably, the crystalline form has lattice 
constants of a = 91.6 A, b = 91.6 A, c = 85.0 A, a = 90°, p = 90°, y = 90°. 
More preferably, the crystalline form is a tetragonal crystalline form. Even 
more preferably, the crystalline form has a space group of P4 3 2i2. Still more 
10 preferably, the PXR ligand binding domain polypeptide has the amino acid 
sequence shoWn in SEQ ID NO: 4. 

In a preferred embodiment, the PXR ligand binding domain polypeptide 
is in complex with a ligand. More preferably, the ligand is a 
hypocholesterolemic drug. Even more preferably, the hypocholesterolemic 
15 drugisSR12813. 

A method for determining the three-dimensional structure of a 
crystallized PXR ligand binding domain polypeptide to a resolution of about 
3.0 A or better is also disclosed. The method comprises (a) crystallizing a 
PXR ligand binding domain polypeptide; and (b) analyzing the PXR ligand 
20 binding domain polypeptide to determine the three-dimensional structure of 
the crystallized PXR ligand binding domain polypeptide, whereby the three- 
dimensional structure of a crystallized PXR ligand binding domain polypeptide 
is determined to a resolution of about 3.0 A or better. 

A method of designing a modulator of a PXR polypeptide is also 
25 disclosed. The method comprises (a) designing a potential modulator of a 
PXR polypeptide that will form bonds with amino acids in a ligand binding site 
based upon a crystalline structure of a PXR ligand binding domain 
polypeptide; (b) synthesizing the modulator; and (c) determining whether the 
potential modulator modulates the activity of the PXR polypeptide, whereby a 
30 modulator of a PXR polypeptide is designed. 

In an alternative embodiment, a method of designing a modulator that 
selectively modulates the activity of a PXR polypeptide in accordance with the 
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present invention comprises: (a) obtaining a crystalline form of a PXR ligand 
binding domain polypeptide; (b) evaluating the three-dimensional structure of 
the crystallized PXR ligand binding domain polypeptide; and (c) synthesizing 
a potential modulator based on the three-dimensional crystal structure of the 
5 crystallized PXR ligand binding domain polypeptide, whereby a modulator that 
selectively modulates the activity of a PXR polypeptide is designed. 
Preferably, the method further comprises contacting a PXR ligand binding 
domain polypeptide with the potential modulator; and assaying the PXR ligand 
binding domain polypeptide for binding of the potential modulator, for a 

10 change in activity of the PXR ligand binding domain polypeptide, or both. 
More preferably, the crystalline form is such that the three-dimensional 
structure of the crystallized PXR ligand binding domain polypeptide can be 
determined to a resolution of about 3.0 A or better. 

In yet another embodiment, a method of designing a modulator of a 

15 PXR polypeptide in accordance with the present invention comprises: (a) 
selecting a candidate PXR ligand; (b) determining which amino acid or amino 
acids of a PXR polypeptide interact with the ligand using a three-dimensional 
model of a crystallized protein comprising a PXR LBD; (c) identifying in a 
biological assay for PXR activity a degree to which the ligand modulates the 

20 activity of the PXR polypeptide; (d) selecting a chemical modification of the 
ligand wherein the interaction between the amino acids of the PXR 
polypeptide and the ligand is predicted to be modulated by the chemical 
modification; (e) performing the chemical modification on the ligand to form a 
modified ligand; (f) contacting the modified ligand with the PXR polypeptide; 

25 (g) identifying in a biological assay for PXR activity a degree to which the 
modified ligand modulates the biological activity of the PXR polypeptide; and 
(h) comparing the biological activity of the PXR polypeptide in the presence of 
modified ligand with the biological activity of the PXR polypeptide in the 
presence of the unmodified ligand, whereby a modulator of a PXR polypeptide 

30 is designed. Preferably, the PXR polypeptide is a PXR polypeptide. More 
preferably, the three-dimensional model of a crystallized protein is a PXR LBD 
polypeptide with a bound ligand. Even more preferably, the method further 
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comprises repeating steps (a) through (f), if the biological activity of the PXR 
polypeptide in the presence of the modified ligand varies from the biological 
activity of the PXR polypeptide in the presence of the unmodified ligand. 

A method for identifying a PXR modulator is also disclosed. The 
5 method comprises (a) providing atomic coordinates of a PXR ligand binding 
domain to a computerized modeling system; and (b) modeling a ligand that 
fits spatially into a binding cavity or on the surface of the PXR ligand binding 
domain to thereby identify a PXR modulator. Preferably, the method further 
comprises identifying in an assay for PXR-mediated activity a modeled ligand 

1 0 that increases or decreases the activity of the PXR. 

A meth6d of identifying a PXR modulator that selectively modulates the 
activity of a PXR polypeptide compared to other polypeptides is disclosed. 
The method comprises (a) providing atomic coordinates of a PXR ligand 
binding domain to a computerized modeling system; and (b) modeling a 

15 ligand that fits spatially into a binding cavity or on the surface of a PXR Kgand 
binding domain and that interacts with conformationally constrained residues 
of a PXR that are conserved among PXk isoforms to thereby identify a PXR 
modulator. Preferably, the method further comprises identifying in a biological 
assay for PXR-mediated activity a modeled ligand that selectively binds to the 

20 PXR ligand binding domain and increases or decreases the activity of the 
PXR. 

An assay method for identifying a compound that inhibits binding of a 
ligand to a PXR polypeptide is disclosed. The assay method comprises: (a) 
designing a test inhibitor compound capable of modulating PXR activity, 

25 based on the atomic coordinates of a PXR ligand binding domain; (b) 
synthesizing the test inhibitor compound; (c) incubating a PXR polypeptide 
with a ligand in the presence of a test inhibitor compound; (d) determining an 
amount of ligand that is bound to the PXR polypeptide, wherein decreased 
binding of ligand to the PXR protein in the presence of the test inhibitor 

30 compound relative to binding of ligand in the absence of the test inhibitor 
compound is indicative of inhibition; and (e) identifying the test compound as 
an inhibitor of ligand binding if decreased ligand binding is observed, whereby 
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a compound that inhibits binding of a ligand to a PXR polypeptide is identified. 
Preferably, the ligand is a hypocholesteroiemic drug. More preferably, the 
hypocholesterolemic drug is SR12813. 

A method of evaluating a candidate therapeutic agent in humans using 
5 mouse model system is disclosed. In a preferred embodiment, the method 
comprises: (a) providing atomic coordinates of a human PXR ligand binding 
domain to a computerized modeling system; (b) modeling a candidate 
therapeutic agent that fits spatially into a binding cavity or on the surface of a 
human PXR ligand binding domain; (c) providing a mouse PXR polypeptide; 

10 (d) selecting one or more mutations to be introduced into an amino acid 
sequence of the mouse PXR polypeptide, the mutations being selected so as 
to alter the mouse PXR polypeptide to be similar to a human PXR 
polypeptide; (e) providing a mutant mouse PXR polypeptide comprising the 
one or more mutations selected in step (d); (f) contacting the candidate 

15 therapeutic agent modeled in step (b) with the mutant mouse PXR 
polypeptide; (g) determining an effect of the candidate therapeutic agent on 
the mutant mouse PXR polypeptide; and (h) evaluating the potential of the 
candidate therapeutic agent for use in humans based on the effect of the 
candidate therapeutic agent on the mutant mouse PXR polypeptide, whereby 

20 the candidate therapeutic agent in humans using a mouse model system is 
evaluated. 

Accordingly, it is an object of the present invention to provide a three 
dimensional structure of the ligand binding domain of PXR. The object is 
achieved in whole or in part by the present invention. 
25 An object of the invention having been stated hereinabove, other 

objects will be evident as the description proceeds, when taken in connection 
with the accompanying Drawings and Laboratory Examples as best described 
hereinbelow. 

30 Brief Description of t he Drawings 

Figure 1 is a ribbon diagram depicting the structure of the ligand 
binding domain of the human xenobiotic receptor PXR. 
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Figure 2 is a cut-away view of the ligand binding cavity of the human 

xenobiotic PXR from a first perspective. 

Figure 3 is a cut-away view of the ligand binding cavity of the human 

xenobiotic PXR from a second perspective. 
5 Figure 4 is an experimentally-observed position of SR12813 in the 

ligand binding cavity of human PXR. In this figure, intermolecular interactions 

are shown directly. Amino acid side chains are shown in blue. SR12813 is 

depicted as a wireframe model and interacting atoms are presented as a 

spacefilling model comprising light gray spheres. 
10 Figure 5 is an experimentally-observed position of SR12813 in the 

ligand binding 1 cavity of human PXR. In this figure, intermolecular interactions 

are shown schematically, van der Waals contacts are indicated by solid 

arrows and hydrogen bonds with dashed arrows. 

Figure 6 is a second experimentally-observed position of SR12813 in 
15 the ligand binding cavity of human PXR. In this figure, intermolecular 

interactions are shown directly. Amino acid side chains are shown' in blue. 

SR12813 is depicted as a wireframe model and interacting atoms are 

presented as a spacefilling model comprising light gray spheres. 

Figure 7 is a second experimentally-observed position of SR12813 in 
20 the ligand binding cavity of human PXR. In this figure, intermolecular 

interactions are shown schematically, van der Waals contacts are indicated 

by solid arrows and hydrogen bonds with dashed arrows. 

Figure 8 is a third experimentally-observed position of SR12813 in the 

ligand binding cavity of human PXR. In this figure, intermolecular interactions 
25 are shown directly. Amino acid side chains are shown in blue. SR12813 is 

depicted as a wireframe model and interacting atoms are presented as a 

spacefilling model comprising light gray spheres. 

Figure 9 is a third experimentally-observed position of SR12813 in the 

ligand binding cavity of human PXR. In this figure, intermolecular interactions 
30 are shown schematically, van der Waals contacts are indicated by solid 

arrows and hydrogen bonds with dashed arrows. 
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Figure 10 is a ribbon diagram depicting two salt bridges (Glu-321 to 
Arg-410 and Asp-205 to Arg-413) adjacent to the ligand binding cavity of 
human PXR. 

Figures 11A-11D are a series of plots representing the luciferase 
5 (normalized luciferase activity counts per second (CPS) x 1000, y axis) and 
alkaline phosphatase activity of several PXR mutants in the presence of 
rifampicin or SR12813 (concentration in M, x axis). To generate this plot, CV- 
1 cells were transfected with expression plasmids for hPXR, D205A, R413A, 
E321A, or R410A. 0=wildtype(WT); A=D205A mutant; «=R413A mutant, in 

1 0 addition to the XREM-CYP3A4-luciferase reporter. 

Figure 5 12 is a computer-generated model depicting the 
homodimerization interaction observed in human PXR. The various distances 
between interacting residues are presented in the figure. The individual 
monomers are denoted by blue and green coloring. 

15 Figure 13 is a ribbon diagram depicting an overview of the 

crystallographically-observed homodimer of human PXR. Individual 
monomers are denoted by red and gold coloring, with green and blue coloring 
representing interacting structure. 

Figure 14 is a plot representing the luciferase and alkaline 

20 phosphatase activity (normalized luciferase activity counts per second (CPS) 
x 1000, y axis) of PXR mutants in the presence of rifampicin(RIF) or 
SR12813(SR) (concentration in M, x axis). To generate this plot, CV-1 cells 
were transfected with expression plasmids encoding wildtype hPXR or mutant 
hPXR comprising the mutations W223A, Y225A and the XREM-CYP3A4- 

25 luciferase reporter. 0=WT-SR; □=W223A J Y225A-RIF; •=WT-SR; ■= 
W223A, Y225A-RIF. 

Figures 15A-15C are a series of bar graphs demonstrating that four 
point mutants "humanize" mouse PXR's sensitivity to ligands. CV-1 cells 
were transfected with expression plasmids for mouse PXR (Fig. 15A), human 

30 PXR (Fig. 15B), or R203L, P205S, Q404H, Q407R (mouse->human) mouse 
PXR (Fig. 15C) and the XREM-CYP3A4-luciferase reporter. On the horizontal 
axis, bars from left to right represent vehicle, PCN and SR12813. 
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Brief Description of the Sequences in the Sequence Listing 
SEQ ID NO: 1 is a DNA sequence encoding a full-length human PXR 
polypeptide (Swiss-Prot Accession No. 075469; GenBank Accession No. 
5 AF061056). 

SEQ ID NO: 2 is an amino acid sequence of a full-length human PXR 
polypeptide and is derived from the DNA sequence of SEQ ID NO: 1 (Swiss- 
Prot Accession No. 075469; GenBank Accession No. AF061056). 

SEQ ID NO: 3 is a DNA sequence encoding the ligand binding domain 
10 of human PXR. The sequence codes for residues 130-434, which 
corresponds to the ligand binding domain of PXR (Swiss-Prot Accession No. 
075469; GenBank Accession No. AF061056). 

SEQ ID NO: 4 is an amino acid sequence of the PXR ligand binding 
domain and is derived from the DNA sequence of SEQ ID NO: 3 (Swiss-Prot 
15 Accession No. 075469; GenBank Accession No. AF061056). 

SEQ ID NO: 5 is an amino acid sequence of the loop involving residues 
309-321 in hPXR (Swiss-Prot Accession No. 075469; GenBank Accession 
No. AF061056). 

SEQ ID NO: 6 is a DNA sequence encoding a full-length human VDR 
20 polypeptide (Swiss-Prot Accession No. P11473; GenBank Accession No. 
J03258). 

SEQ ID NO: 7 is an amino acid sequence of a full-lenth human VDR 
polypeptide and is derived from the DNA sequence of SEQ ID NO: 6 (Swiss- 
Prot Accession No. P1 1473; GenBank Accession No. J03258). 

25 SEQ ID NO: 8 is a DNA sequence encoding the ligand binding domain 

of human VDR. The sequence codes for residues 192-427, which 
corresponds to the ligand binding domain of PXR (Swiss-Prot Accession No. 
P11473; GenBank Accession No. J03258). 

SEQ ID NO: 9 is an amino acid sequence of the VDR ligand binding 

30 domain and is derived from the DNA seqence of SEQ ID NO: 8 (Swiss-Prot 
Accession No. P1 1473; GenBank Accession No. J03258). 
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SEQ ID NO: 10 is a DNA sequence encoding an N-terminal 
polyhistidine tagged PXR ligand binding domain fusion protein comprising 
residues 130-434 from human PXR (fragment from Swiss-Prot Accession No. 
075469; GenBank Accession No. AF061056). 
5 SEQ ID NO: 11 is an amino acid sequence of the N-terminal 

polyhistidine tagged fusion protein comprising residues 130-434 from human 
PXR and is derived from the DNA seqence of SEQ ID NO: 10 (fragment from 
Swiss-Prot Accession No. 075469; GenBank Accession No. AF061056). 

SEQ ID NO: 12 is a DNA sequence encoding residues 623-710 of the 
10 human SRC-1 gene (fragment from GenBank Accession No. U59302). 

SEQ ID'NO: 13 is an amino acid sequence of residues 623-710 of the 
human SRC-1 gene and is derived from the DNA sequence of SEQ ID NO: 12 
(fragment from GenBank Accession No. U59302). 

15 Detailed Description of the Invention 

Until disclosure of the present invention presented herein, the ability to 
obtain crystalline forms of a PXR LBD has not been realized. And until 
disclosure of the present invention presented herein, a detailed three- 
dimensional crystal structure of a PXR polypeptide has not been solved. 

20 In addition to providing structural information, crystalline polypeptides 

provide other advantages. For example, the crystallization process itself 
further purifies the polypeptide, and satisfies one of the classical criteria for 
homogeneity. In fact, crystallization frequently provides unparalleled 
purification quality, removing impurities that are not removed by other 

25 purification methods such as HPLC, dialysis, conventional column 
chromatography, etc. Moreover, crystalline polypeptides are often stable at 
ambient temperatures and free of protease contamination and other 
degradation associated with solution storage. Crystalline polypeptides can 
also be useful as pharmaceutical preparations. Finally, crystallization 

30 techniques in general are largely free of problems such as denaturation 
associated with other stabilization methods (e.g., lyophilization). Once 
crystallization has been accomplished, crystallographic data provides useful 
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structural information that can assist the design of compounds that can serve 
as agonists or antagonists, as described herein below. In addition, the crystal 
structure provides information useful to map a receptor-binding domain, which 
could then be mimicked by a small non-peptide molecule that would serve as 
5 an antagonist or agonist. 

I Definitions 

Following long-standing patent law convention, the terms "a" and "an" 
mean "one or more" when used in this application, including the claims. 
10 As used herein, the term "mutation" carries its traditional connotation 

and means a change, inherited, naturally occurring or introduced, in a nucleic 
acid or polypeptide sequence, and is used in its sense as generally known to 
those of skill in the art. 

As used herein, the term "labeled" means the attachment of a moiety, 
1 5 capable of detection by spectroscopic, radiologic or other methods, to a probe 
molecule. 

As used herein, the term "target cell" refers to a cell, into which it is 
desired to insert a nucleic acid sequence or polypeptide, or to otherwise effect 
a modification from conditions known to be standard in the unmodified cell. A 

20 nucleic acid sequence introduced into a target cell can be of variable length. 
Additionally, a nucleic acid sequence can enter a target cell as a component 
of a plasmid or other vector or as a naked sequence. 

As used herein, the term "transcription" means a cellular process 
involving the interaction of an RNA polymerase with a gene that directs the 

25 expression as RNA of the structural information present in the coding 
sequences of the gene. The process includes, but is not limited to the 
following steps: (a) the transcription initiation, (b) transcript elongation, (c) 
transcript splicing, (d) transcript capping, (e) transcript termination, (f) 
transcript polyadenylation, (g) nuclear export of the transcript, (h) transcript 

30 editing, and (i) stabilizing the transcript. 

As used herein, the term "expression" generally refers to the cellular 
processes by which a polypeptide is produced from RNA. 
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As used herein, the term "transcription factor" means a cytoplasmic or 
nuclear protein which binds to a gene, or binds to an RNA transcript of a 
gene, or binds to another protein which binds to a gene or an RNA transcript 
or another protein which in turn binds to a gene or an RNA transcript, so as to 
5 thereby modulate expression of the gene. Such modulation can additionally 
be achieved by other mechanisms; the essence of a "transcription factor for a 
gene" pertains to a factor that alters the level of transcription of the gene in 
some way. 

As used herein, the term "hybridization" means the binding of a probe 
10 molecule, a molecule to which a detectable moiety has been bound, to a 
target sample. * 

As used herein, the term "detecting" means confirming the presence of 
a target entity by observing the occurrence of a detectable signal, such as a 
radiologic or spectroscopic signal that will appear exclusively in the presence 

1 5 of the target entity. 

As used herein, the term "sequencing" means determining the ordered 
linear sequence of nucleic acids or amino acids of a DNA or protein target 
sample, using conventional manual or automated laboratory techniques. 

As used herein, the term "isolated" means oligonucleotides 

20 substantially free of other nucleic acids, proteins, lipids, carbohydrates or 
other materials with which they can be associated, such association being 
either in cellular material or in a synthesis medium. The term can also be 
applied to polypeptides, in which case the polypeptide will be substantially 
free of nucleic acids, carbohydrates, lipids and other undesired polypeptides. 

25 As used herein, the term "substantially pure" means that the 

polynucleotide or polypeptide is substantially free of the sequences and 
molecules with which it is associated in its natural state, and those molecules 
used in the isolation procedure. The term "substantially free" means that the 
sample is at least 50%, preferably at least 70%, more preferably 80% and 

30 most preferably 90% free of the materials and compounds with which is it 
associated in nature. 
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As used herein, the term "primer" means a sequence comprising two or 
more deoxyribonucleotides or ribonucleotides, preferably more than three, 
and more preferably more than eight and most preferably at least about 20 
nucleotides of an exonic or intronic region. Such oligonucleotides are 

5 preferably between ten and thirty bases in length. 

As used herein, the term "DNA segment" means a DNA molecule that 
has been isolated free of total genomic DNA of a particular species. In a 
preferred embodiment, a DNA segment encoding a PXR polypeptide refers to 
a DNA segment that contains SEQ ID NO: 1, but can optionally comprise 

10 fewer or additional nucleic acids, yet is isolated away from, or purified free 
from, total genomic DNA of a source species, such as Homo sapiens. 
Included within the term "DNA segment" are DNA segments and smaller 
fragments of such segments, and also recombinant vectors, including, for 
example, plasmids, cosmids, phages, viruses, and the like. 

15 As used herein, the phrase "enhancer-promoter" means a composite 

unit that contains both enhancer and promoter elements. An enhancer- 
promoter is operatively linked to a coding sequence that encodes at least one 
gene product. 

As used herein, the phrase "operatively linked" means that an 
20 enhancer-promoter is connected to a coding sequence in such a way that the 
transcription of that coding sequence is controlled and regulated by that 
enhancer-promoter. Techniques for operatively linking an enhancer-promoter 
to a coding sequence are well known in the art; the precise orientation and 
location relative to a coding sequence of interest is dependent, inter alia, upon 
25 the specific nature of the enhancer-promoter. 

As used herein, the terms "candidate substance" and "candidate 
compound" are used interchangeably and refer to a substance that is believed 
to interact with another moiety, for example a given ligand that is believed to 
interact with a complete, or a fragment of, a PXR polypeptide, and which can 
30 be subsequently evaluated for such an interaction. Representative candidate 
substances or compounds include "xenobiotics", such as drugs and other 
therapeutic agents, carcinogens and environmental pollutants, natural 
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products and extracts, as well as "endobiotics", such as steroids, fatty acids 
and prostaglandins. Other examples of candidate compounds that can be 
investigated using the methods of the present invention include, but are not 
restricted to, agonists and antagonists of a PXR polypeptide, toxins and 
5 venoms, viral epitopes, hormones (e.g., opioid peptides, steroids, etc.), 
hormone receptors, peptides, enzymes, enzyme substrates, co-factors, 
lectins, sugars, oligonucleotides or nucleic acids, oligosaccharides, proteins, 
small molecules and monoclonal antibodies. 

As used herein, the term "biological activity" means any observable 
10 effect flowing from interaction between a PXR polypeptide and a ligand. 
Representative, but non-limiting, examples of biological activity in the context 
of the present invention include dimerization of a PXR and association of a 
PXR with DNA. 

As used herein, the term "modified" means an alteration from an 

15 entity's normally occurring state. An entity can be modified by removing 
discrete chemical units or by adding discrete chemical units. The term 
"modified" encompasses detectable labels as well as those entities added as 
aids in purification. 

As used herein, the terms "structure coordinates" and "structural 

20 coordinates" mean mathematical coordinates derived from mathematical 
equations related to the patterns obtained on diffraction of a monochromatic 
beam of X-rays by the atoms (scattering centers) of a molecule in crystal 
form. The diffraction data are used to calculate an electron density map of the 
repeating unit of the crystal. The electron density maps are used to establish 

25 the positions of the individual atoms within the unit cell of the crystal. 

Those of skill in the art understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the 
purpose of this invention, any set of structure coordinates for PXR or a PXR 
mutant that have a root mean square deviation (RMSD) from ideal of 

30 preferably no more than 1.5 A, more preferably no more than 1.0 A, and most 
preferably no more than 0.5 A when superimposed, using the polypeptide 
backbone atoms, on the structure coordinates listed in Table 4 shall be 
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considered identical. 

As used herein, the term "space group" means the arrangement of 
symmetry elements of a crystal. 

As used herein, the term "molecular replacement" means a method 
5 that involves generating a preliminary model of the wild-type PXR ligand 
binding domain, or a PXR mutant crystal whose structure coordinates are 
unknown, by orienting and positioning a molecule whose structure 
coordinates are known within the unit cell of the unknown crystal so as best to 
account for the observed diffraction pattern of the unknown crystal. Phases 

10 can then be calculated from this model and combined with the observed 
amplitudes to give an approximate Fourier synthesis of the structure whose 
coordinates are unknown. This, in turn, can be subject to any of the several 
forms of refinement to provide a final, accurate structure of the unknown 
crystal. See, e.g. , Lattman , (1985) Method EnzymoL, 115: 55-77; Rossmann , 

15 ed, (1972) The Molecular Replacement Method , Gordon & Breach, New 
York.) Using the structure coordinates of the ligand binding domain of hPXR 
provided by this invention, molecular replacement can be used to determine 
the structure coordinates of a crystalline mutant or homologue of the hPXR 
ligand binding domain, or of a different crystal form of the hPXR ligand binding 

20 domain. 

As used herein, the term "isomorphous replacement" means a method 
of using heavy atom derivative crystals to obtain the phase information 
necessary to elucidate the three-dimensional structure of a native crystal 
( Blundell et al M (1976) Protein Crystallography , Academic Press; Otwinowski , 

25 (1991), in Isomorphous Replacement and Anomalous Scattering , (Evans & 
Leslie, eds.), 80-86, Daresbury Laboratory, Daresbury, United Kingdom). The 
phrase "heavy-atom derivatization" is synonymous with the term 
"isomorphous replacement". 

As used herein, the terms "p-sheet" and "beta-sheet" mean the 

30 conformation of a polypeptide chain stretched into an extended zig-zig 
conformation. Portions of polypeptide chains that run "parallel" all run in the 



WO 02/095652 PCT/US02/ 15701 

-22- 

same direction. Polypeptide chains that are "antiparallel" run in the opposite 
direction from the parallel chains. 

As used herein, the terms "a-helix" and "alpha-helix" mean the 
conformation of a polypeptide chain wherein the polypeptide backbone is 
5 wound around the long axis of the molecule in a left-handed or right-handed 
direction, and the R groups of the amino acids protrude outward from the 
helical backbone, wherein the repeating unit of the structure is a single turnoff 
the helix, which extends about 0.56 nm along the long axis. 

As used herein, the term "unit cell" means a basic parallelepiped 

10 shaped block. The entire volume of a crystal can be constructed by regular 
assembly of siich blocks. Each unit cell comprises a complete representation 
of the unit of pattern, the repetition of which builds up the crystal. Thus, the 
term "unit cell" means the fundamental portion of a crystal structure that is 
repeated infinitely by translation in three dimensions. A unit cell is 

15 characterized by three vectors a, b, and c, not located in one plane, which 
form the edges of a parallelepiped. Angles a, p and y define the' angles 
between the vectors: angle a is the angle between vectors b and c; angle p is 
the angle between vectors a and c; and angle y is the angle between vectors 
a and b. The entire volume of a crystal can be constructed by regular 

20 assembly of unit cells; each unit cell comprises a complete representation of 
the unit of pattern, the repetition of which builds up the crystal. 

As used herein, the term "tetragonal unit cell" means a unit cell wherein 
a = b * c; and a = p = y = 90°. The vectors a, b and c describe the unit cell 
edges and the angles a, p, and y describe the unit cell angles. 

25 As used herein, the term "crystal lattice" means the array of points 

defined by the vertices of packed unit cells. 

As used herein, the term "ligand binding site" and "ligand binding 
domain" are used interchangeably and mean that site in a polypeptide where 
substrate binding occurs. For hPXR, the ligand binding domain comprises the 

30 residues 1 30-434 of the full-length human PXR protein. 

As used herein, the term "PXR" means nucleic acids encoding a 
pregnane X receptor (PXR) nuclear receptor polypeptide that can bind DNA 
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and/or one or more ligands, and/or has the ability to form multimers. The term 
"P^R" includes invertebrate homologs; however, preferably, PXR nucleic 
acids and polypeptides are isolated from vertebrate sources. "PXR" further 
includes vertebrate homologs of PXR family members, including, but not 
5 limited to, mammalian and avian homologs. Representative mammalian 
homologs of PXR family members include, but are not limited to, murine and 
human homologs. 

As used herein, the terms "PXR gene product", "PXR protein", "PXR 
polypeptide", and "PXR peptide" are used interchangeably and mean peptides 

10 having amino acid sequences which are substantially identical to native amino 
acid sequenced from an organism of interest and which are biologically active 
in that they comprise all or a part of the amino acid sequence of a PXR 
polypeptide, or cross-react with antibodies raised against a PXR polypeptide, 
or retain all or some of the biological activity (e.g., DNA or ligand binding 

15 ability and/or dimerization ability) of the native amino acid sequence or 
protein. Such biological activity can include immunogenicity. 

As used herein, the terms "PXR gene product", "PXR protein", "PXR 
polypeptide", and "PXR peptide" also include analogs of a PXR polypeptide. 
By "analog" is intended that a DNA or peptide sequence can contain 

20 alterations relative to the sequences disclosed herein, yet retain all or some of 
the biological activity of those sequences. Analogs can be derived from 
genomic nucleotide sequences as are disclosed herein or from other 
organisms, or can be created synthetically. Those skilled in the art will 
appreciate that other analogs, as yet undisclosed or undiscovered, can be 

25 used to design and/or construct PXR analogs. There is no need for a "PXR 
gene product", "PXR protein", "PXR polypeptide", or "PXR peptide" to 
comprise all or substantially all of the amino acid sequence of a PXR 
polypeptide gene product. Shorter or longer sequences are anticipated to be 
of use in the invention; shorter sequences are herein referred to as 

30 "segments". Thus, the terms "PXR gene product", "PXR protein", "PXR 
polypeptide", and "PXR peptide" also include fusion, chimeric or recombinant 
PXR polypeptides and proteins comprising sequences of the present 
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invention. Methods of preparing such proteins are disclosed herein and are 
known in the art. 

As used herein, the term "polypeptide" means any polymer comprising 
any of the 20 protein amino acids, regardless of its size. Although "protein" is 
5 often used in reference to relatively large polypeptides, and "peptide" is often 
used in reference to small polypeptides, usage of these terms in the art 
overlaps and varies. The term "polypeptide" as used herein refers to 
peptides, polypeptides and proteins, unless otherwise noted. As used herein, 
the terms "protein", "polypeptide" and "peptide" are used interchangeably 
10 herein when referring to a gene product. 

As used herein, the term "modulate" means an increase, decrease, or 
other alteration of any, or all, chemical and biological activities or properties of 
a wild-type or mutant PXR polypeptide. The term "modulation" as used herein 
refers to both upregulation (i.e., activation or stimulation) and downregulation 
1 5 (i.e. inhibition or suppression) of a response. 

As used herein, the terms "PXR gene" and "recombinant PXR gene" 
mean a nucleic acid molecule comprising an open reading frame encoding a 
PXR polypeptide of the present invention, including both exon and (optionally) 
intron sequences. 

20 As used herein, the term "gene" is used for simplicity to refer to a 

functional protein, polypeptide or peptide encoding unit. As will be understood 
by those in the art, this functional term includes both genomic sequences and 
cDNA sequences. Preferred embodiments of genomic and cDNA sequences 
are disclosed herein. 

25 As used herein, the term "DNA sequence encoding a PXR polypeptide" 

can refer to one or more coding sequences within a particular individual. 
Moreover, certain differences in nucleotide sequences can exist between 
individual organisms, which are called alleles. It is possible that such allelic 
differences might or might not result in differences in amino acid sequence of 

30 the encoded polypeptide yet still encode a protein with the same biological 
activity. As is well known, genes for a particular polypeptide can exist in 
single or multiple copies within the genome of an individual. Such duplicate 
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genes can be identical or can have certain modifications, including nucleotide 
substitutions, additions or deletions, all of which still code for polypeptides 
having substantially the same activity. 

As used herein, the term "intron" means a DNA sequence present in a 
5 given gene that is not translated into protein. 

As used herein, the term "interact" means detectable interactions 
between molecules, such as can be detected using, for example, a yeast two 
hybrid assay. The term "interact" is also meant to include "binding" 
interactions between molecules. Interactions can, for example, be protein- 
1 0 protein or protein-nucleic acid in nature. 

As used herein, the terms "cells," "host cells" or "recombinant host 
cells" are used interchangeably and mean not only to the particular subject 
cell, but also to the progeny or potential progeny of such a cell. Because 
certain modifications can occur in succeeding generations due to either 
15 mutation or environmental influences, such progeny might not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as 
used herein. 

As used herein, the term "agonist" means an agent that supplements 
or potentiates the bioactivity of a functional PXR gene or protein, of a 

20 polypeptide encoded by a gene that is up- or down-regulated by a PXR 
polypeptide, and/or a polypeptide encoded by a gene that contains a PXR 
binding site in its promoter region. 

As used herein, the term "antagonist" means an agent that decreases 
or inhibits the bioactivity of a functional PXR gene or protein, or that 

25 supplements or potentiates the bioactivity of a naturally occurring or 
engineered non-functional PXR gene or protein. Alternatively, an antagonist 
can decrease or inhibit the bioactivity of a functional gene or polypeptide 
encoded by a gene that is up- or down-regulated by a PXR polypeptide and/or 
contains a PXR binding site in its promoter region. An antagonist can also 

30 supplement or potentiate the bioactivity of a naturally occurring or engineered 
non-functional gene or polypeptide encoded by a gene that is up- or down- 
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regulated by a PXR polypeptide, and/or contains a PXR binding site in its 
promoter region. 

As used herein, the terms "chimeric protein" or "fusion protein" are 
used interchangeably and mean a fusion of a first amino acid sequence 
5 encoding a PXR polypeptide with a second amino acid sequence defining a 
polypeptide domain foreign to, and not homologous with, any domain of one 
of a PXR polypeptide. A chimeric protein can present a foreign domain that is 
found in an organism that also expresses the first protein, or it can be an 
"interspecies" or "intergenic" fusion of protein structures expressed by 

10 different kinds of organisms. In general, a fusion protein can be represented 
by the generaltformula X— PXR — Y, wherein PXR represents a portion of the 
protein which is derived from a PXR polypeptide, and X and Y are 
independently absent or represent amino acid sequences which are not 
related to a PXR sequence in an organism, which includes naturally occurring 

15 mutants. The term "chimeric gene" refers to a nucleic acid construct that 
encodes a "chimeric protein" or "fusion protein" as defined herein. 

As used herein, the term "therapeutic agent" is a chemical entity 
intended to effectuate a change in an organism. Preferably, but not 
necessarily, the organism is a human being. It is not necessary that a 

20 therapeutic agent be known to effectuate a change in an organism; chemical 
entities that are suspected, predicted or designed to effectuate a change in an 
organism are therefore encompassed by the term "therapeutic agent." The 
effectuated change can be of any kind, observable or unobservable, and can 
include, for example, a change in the biological activity of a protein. 

25 Representative therapeutic compounds include small molecules, 

proteins and peptides, oligonucleotides of any length, "xenobiotics", such as 
drugs and other therapeutic agents, carcinogens and environmental 
pollutants, natural products and extracts, as well as "endobiotics", such as 
steroids, fatty acids and prostaglandins. Other examples of therapeutic 

30 agents can include, but are not restricted to, agonists and antagonists of a 
PXR polypeptide, toxins and venoms, viral epitopes, hormones (e.g., opioid 
peptides, steroids, etc.), hormone receptors, peptides, enzymes, enzyme 
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substrates, co-factors, lectins, sugars, oligonucleotides or nucleic acids, 
oligosaccharides, proteins, small molecules and monoclonal antibodies. 

IL Description of Tables 
5 Table 1 is a table summarizing the crystal and data statistics obtained 

from the crystallized ligand binding domain of human PXR. Data on the unit 
cell are presented, including data on the crystal space group, unit cell 
dimensions, molecules per asymmetric cell and crystal resolution. 

Table 2 is a table comparing human PXR-LBD with nuclear receptor 
1 0 LBDs of known structure. 

Table 3is a table summarizing the calculated buried solvent-accessible 
surface area of amino acid residues that line the ligand binding cavity of 
human PXR. 

Table 4 is a table of the atomic structure coordinate data obtained from 
15 X-ray diffraction from the ligand binding domain of human PXR in complex 
with a ligand. 

Table 5 is a table of the atomic structure coordinate data obtained from 
X-ray diffraction from human vitamin D receptor that was used in the 
molecular replacement solution of the human PXR ligand binding domain. 

20 

HI, General Considerations 

Expression of the CYP3A genes is induced at the level of transcription 
by a variety of xenobiotics, including many that are metabolized by CYP3A. 
This transcriptional regulation of CYP3A expression provides a mechanism for 

25 amplifying the physiologic response during periods of prolonged xenobiotic 
challenge. The induction of CYP3A expression also represents the basis for 
an important class of drug-drug interactions. Recently, a member of the 
nuclear receptor family of ligand-activated transcription facts, the pregnane X 
receptor (PXR) was shown to serve as a key regulator of CYP3A gene 

30 expression. The human ortholog of PXR is alternately referred to as the 
pregnane-activated receptor (PAR) or the steroid and xenobiotic receptor 
(SXR). Like other nuclear receptors PXR contains a DNA binding domain 
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(DBD) and a ligand binding domain (LBD). PXR binds to xenobiotic response 
elements in the regulatory regions of CYP3A genes as a heterodimer with the 
9-cis retinoic receptor (RXR). Notably, PXR is activated by most of the 
xenobiotics that are known to induce CYP3A gene expression, including. 
5 commonly used drugs such as the antibiotic rifampicin and the glucocorticoid 
dexamethasone. Certain endogenous compounds also activate PXR, 
including the toxic bile acid lithocholic acid and certain C21 steroids 
(pregnanes). Thus, unlike the steroid, retinoid and thyroid hormone receptors, 
which are highly selective for their cognate hormone, PXR has evolved in 

10 order to detect structurally diverse substrates. Interestingly, there are marked 
differences in the pharmacologic activation profile of PXR across species. For 
instance, human PXR is activated efficiently by the rifampicin and the 
hypocholesterolemic drug SR12813, whereas mouse PXR is not. By contrast, 
mouse PXR is activated by the synthetic steroid pregnenolone 16a- 

15 carbonitrile (PCN), whereas the human receptor is not. These data suggest 
that PXR has evolved to detect the different xenobiotic challenges faced by 
different species. 

The present invention will usually be applicable mutatis mutandis to all 
PXR polypeptides, as discussed herein based, in part, on the patterns of PXR 

20 structure and modulation that have emerged as a consequence of 
determining the three dimensional structure of human PXR in complex with a 
ligand. Generally speaking, PXR homologs and orthologs display substantial 
regions of amino acid homology. Additionally, the PXRs display an overall 
structural motif comprising three modular domains: 

25 1 ) a variable amino-terminal domain; 

2) a highly conserved DNA-binding domain (DBD); and 

3) a less conserved carboxy-terminal ligand binding domain (LBD). 
The modularity of PXR permits different domains of each protein to separately 
accomplish different functions, although the domains, can influence each 

30 other. The separate function of a domain is usually preserved when a 
particular domain is isolated from the remainder of the protein. Using 
conventional protein chemistry techniques, a modular domain can sometimes 
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be separated from the parent protein. Using conventional molecular biology 
techniques, each domain can usually be separately expressed with its original 
function intact or, as discussed herein below, chimeric proteins comprising 
two different proteins can be constructed, wherein the chimeric proteins retain 
5 the properties of the individual functional domains of the respective 
polypeptides from which the chimeric proteins were generated. 

The amino terminal domain of PXR is the least conserved of the three 
domains. This domain is involved in transcriptional activation and, in some 
cases, its uniqueness can dictate selective receptor-DNA binding and 

1 0 activation of target genes by PXR. 

The DBD is the most conserved structure in PXR. It typically contains 
about 70 amino acids that fold into two zinc finger motifs, wherein a zinc ion 
coordinates four cysteines. The DBD generally contains two perpendicularly 
oriented a-helices that extend from the base of the first and second zinc 

15 fingers. The two zinc fingers function in concert along with non-zinc finger 
residues to direct the PXRs to specific target sites on DNA. Various amino 
acids in the DBD influence spacing beltween two half-sites (which usually 
comprises six nucleotides) for receptor homodimerization. The optimal 
spacings facilitate cooperative interactions between DBDs, and D box 

20 residues are part of the dimerization interface. Other regions of the DBD 
facilitate DNA-protein and protein-protein interactions required for PXR-RXR 
heterodimerization. 

The LBD is the second most highly conserved domain in these 
receptors. Whereas the integrity of several different LBD sub-domains is 

25 important for ligand binding, truncated molecules containing only the LBD can 
retain normal ligand binding activity. This domain also participates in other 
functions, including dimerization, nuclear translocation and transcriptional 
regulation activities. Importantly, this domain can bind a ligand and can 
undergo ligand-induced conformational changes. Ligand binding allows the 

30 activation domain to serve as an interaction site for essential co-activator 
proteins that function to stimulate or inhibit transcription. 
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The carboxy-terminal activation subdomain is in close three- 
dimensional proximity in the LBD to the ligand, so as to allow for ligands 
bound to the LBD to coordinate (or interact) with amino acid(s) in the 
activation subdomain. As disclosed herein, the LBD of PXR is expressed, 
5 crystallized and its three dimensional structure determined. Computational 
and other methods for the design of ligands to the LBD are also disclosed. 

IV. Production of PXR Polypeptides 

The native and mutated PXR polypeptides, and fragments thereof, of 

10 the present invention can be chemically synthesized in whole or part using 
techniques that are well-known in the art (See, e.g. , Creighton , (1983) 
Proteins: Structures and Molecular Principles , W.H. Freeman & Co., New 
York, incorporated herein in its entirety). Alternatively, methods that are well 
known to those skilled in the art can be used to construct expression vectors 

15 containing a partial or the entire native or mutated PXR polypeptide coding 
sequence and appropriate transcriptional/translational control signals? These 
methods include in vitro recombinant DNA techniques, synthetic techniques 
and in vivo recombination/genetic recombination. See, for example, the 
techniques described in Sambrook et al. , (1989) Molecular Cloning: A 

20 Laboratory Manual , Cold Spring Harbor Laboratory, New York, and Ausubel 
et al. , (1989) Current Protocols in Molecular Biolog y, Greene Publishing 
Associates and Wiley Interscience, New York, both incorporated herein in 
their entirety. 

A variety of host-expression vector systems can be utilized to express 
25 a PXR coding sequence. These include but are not limited to microorganisms 
such as bacteria transformed with recombinant bacteriophage DNA, plasmid 
DNA or cosmid DNA expression vectors containing a PXR coding sequence; 
yeast transformed with recombinant yeast expression vectors containing a 
PXR coding sequence; insect cell systems infected with recombinant virus 
30 expression vectors (e.g., baculovirus) containing a PXR coding sequence; 
plant cell systems infected with recombinant virus expression vectors (e.g., 
cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed 
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with recombinant plasmid expression vectors (e.g., Ti plasmid) containing a 
PXR coding sequence; or animal cell systems. The expression elements of 
these systems vary in their strength and specificities. 

Depending on the host/vector system utilized, any of a number of 
5 suitable transcription and translation elements, including constitutive and 
inducible promoters, can be used in the expression vector. For example, 
when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like can be 
used. When cloning in insect cell systems, promoters such as the baculovirus 

10 polyhedrin prqmoter can be used. When cloning in plant cell systems, 
promoters derived from the genome of plant cells, such as heat shock 
promoters; the promoter for the small subunit of RUBISCO; the promoter for 
the chlorophyll a/b binding protein) or from plant viruses (e.g., the 35S RNA 
promoter of CaMV; the coat protein promoter of TMV) can be used. When 

15 cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells (e.g., metallothionein promoter) or from mammalian viruses 
(e.g., the adenovirus late promoter; the vaccinia virus 7.5K promoter) can be 
used. When generating cell lines that contain multiple copies of the tyrosine 
kinase domain DNA, SV40-, BPV- and EBV-based vectors can be used with 

20 an appropriate selectable marker. 

V. Formation of PXR Ligand Binding Domain Crystals 

In one embodiment, the present invention provides crystals of PXR. 
The crystals were obtained using the methodology disclosed in the Laboratory 

25 Examples. The PXR crystals, which can be native crystals, derivative crystals 
or co-crystals, have tetragonal unit cells (a tetragonal unit cell is a unit cell 
wherein a = b * c, and wherein a = p = y = 90°) and space group symmetry 
P4 3 2-|2. There is one PXR molecule in the asymmetric unit. In the PXR 
crystalline form, the unit cell has dimensions of a= b= 91.6 A, c= 85.0 A, and 

30 a = p = y = 90°. 

The structures of the apo and SR12813-bound forms of PXR were 
solved by molecular replacement using the structure of the human vitamin D 
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receptor as a search model ( Rochel et al„ (2000) MoL Cell 5: 173-79, PDB ID: 
1DB1; GenBank Accession No. XM007046; available online at 
http://www.rcsb.org/pdb/ ). The apo form was refined to a resolution of about 
2.5 A. The ligand-bound form was refined to a resolution of about 2.75 A. 
5 The heavy atom derivatized form of the PXR LBD-ligand structure can 

be solved using single isomorphous replacement anomalous scattering 
(SIRAS) techniques and/or multiwavelength anomalous diffraction (MAD) 
techniques. In the SIRAS method of solving protein crystals, a derivative 
crystal is prepared that contains an atom that is heavier than the other atoms 

10 of the sample^ Heavy atom derivative crystals are commonly prepared by 
soaking a crystal in a solution containing a selected heavy atom salt. For 
example, some heavy atom derivative crystals have been prepared by 
soaking a crystalline form of the protein of interest in a solution of methyl 
mercury chloride (MeHgCI). 

15 Another representative heavy atom that can be incorporated into a 

derivative crystal is iodine. Heavy atoms can associate with the protein of 
interest, or can be localized in a ligand that associates with a protein of 
interest. In the present invention, the latter approach was taken. Specifically, 
an iodine-containing form of the ligand SR12813 was co-crystaliized with the 

20 PXR LBD and was used to assist in the positioning of the ligand relative to the 
PXR LBD. 

Analysis of derivative crystals takes advantage of differences in the 
reflections from the derivative crystal as compared to the underivatized 
crystal. Symmetry-related reflections in the X-ray diffraction pattern, which 

25 are usually identical, are altered by the anomalous scattering contribution of 
the heavy atoms. The measured differences in symmetry-related reflections 
are used to determine the position of the heavy atoms, leading to an initial 
estimation of the diffraction phases, and subsequently, an electron density 
map is prepared. The prepared electron density map is then used to identify 

30 the position of the other atoms in the sample. 
V.A . Preparation of PXR Crystals 
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The native and derivative co-crystals, and fragments thereof, disclosed 
in the present invention can be obtained by a variety of techniques, including 
batch, liquid bridge, dialysis, vapor diffusion and hanging drop methods ( See, 
e.g. , McPherson , (1982) Preparation and Analysis of Protein Crystals , John 
5 Wiley, New York.; McPherson , (1990) Eur. J. Biochem. 189:1-23.; Weber , 
, (1991) Adv. Protein Chem. 41:1-36). In a preferred embodiment, the vapor 
diffusion and hanging drop methods are used for the crystallization of PXR 
polypeptides and fragments thereof. 

In general, native crystals of the present invention are grown by 

10 dissolving substantially pure PXR LBD polypeptide or a fragment thereof in an 
aqueous buffer containing a precipitant at a concentration just below that 
necessary to precipitate the protein. Water is removed by controlled 
evaporation to produce precipitating conditions, which are maintained until 
crystal growth ceases. 

15 In a preferred embodiment of the invention, native crystals are grown 

by vapor diffusion ( See, e.g. , McPherson , (1982) Preparation and Analysis of 
Protein Crystals , John Wiley, New York; McPherson , (1990) Eur. J. Biochem. 
189:1-23). In this method, the polypeptide/precipitant solution is allowed to 
equilibrate in a closed container with a larger aqueous reservoir having a 

20 precipitant concentration optimal for producing crystals. Generally, less than 
about 25 |xL of PXR LBD polypeptide solution is mixed with an equal volume 
of reservoir solution, giving a precipitant concentration about half that required 
for crystallization. This solution is suspended as a droplet underneath a 
coverslip, which is sealed onto the top of the reservoir. The sealed container 

25 is allowed to stand, until crystals grow. Crystals generally form within two to 
six weeks, and are suitable for data collection within approximately seven to 
ten weeks. Of course, those of skill in the art will recognize that the above- 
described crystallization procedures and conditions can be varied. 



30 
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V.B. Preparation of Derivative Crystals 

Derivative crystals of the present invention, e.g. heavy atom derivative 
crystals, can be obtained by soaking native crystals in mother liquor 
containing salts of heavy metal atoms. Alternatively, a ligand comprising a 
5 heavy atom can be associated with a protein, and subsequently co- 
crystallized. Such derivative crystals are useful for phase analysis in the 
solution of crystals of the present invention. This mechanism provides 
derivative crystals suitable for use as isomorphous replacements in 
determining the X-ray crystal structure of a PXR polypeptide. Additional 
10 reagents useful for the preparation of the derivative crystals of the present 
invention will be apparent to those of skill in the art after review of the 
disclosure of the present invention presented herein. 



V.C. Preparation of Co-crystals 

15 Co-crystals of the present invention can be obtained by soaking a 

native crystal in mother liquor containing compounds known or predicted to 
bind the LBD of a PXR, or a fragment thereof. Alternatively, co-crystals can 
be obtained by co-crystallizing a PXR LBD polypeptide or a fragment thereof 
in the presence of one or more compounds known or predicted to bind the 

20 polypeptide. In a preferred embodiment of the present invention, for example, 
the ligand SR12831, or an iodinated form of the ligand SR12831, is co- 
crystallized with a PXR-SRC-1 complex. 



V.D. Solving a Crystal Structure of the Present Invention 
25 Crystal structures of the present invention can be solved using a 

variety of techniques including, but not limited to, isomorphous replacement 
anomalous scattering or molecular replacement methods. Computer software 
packages will also be helpful in solving a crystal structure of the present 
invention. Applicable software packages include but are not limited to X- 
30 PLOR™ program ( Brunger , (1992) X-PLOR, Version 3.1. A System for X-ray 
Crystallography and NMR, Yale University Press, New Haven, Connecticut; 
X-PLOR is available from Molecular Simulations, Inc., San Diego, California), 
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Xtal View ( McRee , (1992) J. Mol. Graphics 10: 44-47; X-tal View is available 
from the San Diego Supercomputer Center), SHELXS 97 ( Sheldrick (1990) 
Acta Cryst. A46: 467; SHELX 97 is available from the Institute of Inorganic 
Chemistry, Georg-August-Universitat, Gottingen, Germany), HEAVY 
5 (Terwilliger, Los Alamos National Laboratory) can be used and SHAKE-AND- 
BAKE (Hauptman , (1997) Curr. Opin. Struct Biol. 7: 672-80; Weeks et al. , 
(1993) Acta Cryst D49: 179; available from the Hauptman-Woodward Medical 
Research Institute, Buffalo, New York). See also , Ducruix & Geige , (1992) 
Crystallization of Nucleic Acids and Proteins: A Practical Approach , IRL 
10 Press, Oxford,: England, and references cited therein. 

VI. Summary of Results for the PXR Ligand binding Domain 

Two structures of the LBD of human PXR are disclosed in the present 
invention. First, a 2.5 A apo structure and second, a 2.75 A structure in 

15 complex with the high affinity ligand SR12813. More specifically," a 1:1 
complex, comprising residues 130-434 of human PXR and residues 623-710 
of the human transcriptional co-activator protein SRC-1, was incubated in the 
crystallization mixture. The mixture was crystallized. The 88-amino acid co- 
activator peptide SRC-1 was not observed in the crystal structure, and the 

20 examination of extensively washed and dissolved crystals by SDS-PAGE 
revealed that this co-activator fragment was not present in the crystals of the 
PXR LBD. Residues 142-177 and 198-431 of human PXR LBD were traced 
in the structures reported in an aspect of the present invention, while ordered 
electron density of the remaining amino acids, including the 20-residue stretch 

25 from 178-197, was not observed at any time during refinement. 

The hydrophobic ligand binding cavity of human. PXR contains a small 
number of polar residues, permitting the ligand SR12813 to bind in three 
distinct orientations. Despite the promiscuity of this receptor, human and 
mouse PXR are activated by different collections of xenobiotics. Critical 

30 residues responsible for this directed promiscuity are disclosed in the present 
invention. Interestingly, the mutation of only four residues "humanizes" 
murine PXR. That is, the mutation of four residues is obsen/ed to change the 
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binding profile of murine PXR from a murinic profile to that of the human PXR. 
The crystal structures of the present invention disclose important insights into 
how human PXR detects xenobiotics and thus, can be useful, in predicting and 
avoiding drug-drug interactions. 

5 

Vl-A. Overall Structure of the Human PXR Ligand Binding Domain 
Two structures of the ligand binding domain (LBD) of hPXR were 
determined by molecular replacement (Navaza & Saludjian , (1997) Method 
Enzymol. 276A: 581-94) and were refined with CNS ( Brunger et al. , (1998) 

10 Acta Crystallogr. D 54: 905-21). As noted herein above, the solved structures 
disclosed in the present invention include the human PXR LBD apo structure, 
which was solved to a resolution of 2.5 A, and the structure of human PXR in 
complex with the high-affinity (K d = 41 nM) ligand SR12813, which was solved 
to a resolution of 2.75 A. The apo structure was refined to final R and R fre e 

15 values of 0.209 and 0.284 respectively. The ligand-bound structure "(PXR- 
SR12813) was refined to R and Rf ree values of 0.213 and 0.274, respectively. 
These and other calculated and observed crystallographic data are 
summarized in Table 1. Residues 142-177 and 198-431 of the human PXR 
LBD were traced in the structures reported in the present invention. 

20 The human PXR LBD is an "a-helical sandwich" comprising three 

layers: a1/a3, a7/a10, and a middle layer comprising a-helices 4, 5 and 8. 
See Figures 1 and 13. This region of the molecule is similar to nuclear 
receptor ligand binding domains having known structure ( Moras & 
Gronenmeyer , (1998) Curr. Opin. Ceil Biol. 10: 384-91; Weatherman et al. , 

25 (1999) Annu. Rev. Biochem. 68: 559-81). The standard three-stranded 13- 
sheet is expanded to a five-stranded antiparallel (3-sheet in the human PXR 
LBD, comprising p-strands 1, 1', 2, 3 and 4, and the ligand binding cavity is 
localized at the bottom of the molecule. This structural feature is shown in 
Figure 1. The structures of the apo and ligand-bound forms of the hPXR LBD 

30 are essentially identical, exhibiting a root-mean-square deviation (RMSD) of 
0.68 A over the Ccc positions, and 0.89 A over all atoms. In both the apo and 
SR12813-bound structures, the activation function 2 helix (a-AF2), which 
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piays a critical role in transcriptional activation by nuclear receptors, is packed 
against the body of the receptor in a position that appears permissive for 
coactivator interactions. 

The human PXR (SEQ ID NOs: 1 and 2) LBD (SEQ ID NOs: 3 and 4) 
5 is most closely related in structure to the vitamin D receptor (VDR) ( Rochel et 
aL, (2000) Mol. Cell 5: 173-79), with which it shares 45% sequence identity 
(SEQ ID NOs: 8 and 9) and exhibits a 1.8 A RMSD over 225 equivalent 
Cot positions. Table 2 illustrates this similarity. More limited structural 
similarity, between 2.4 and 2.9 A RMSD, is shared with the ligand binding 

10 domains of the retinoid x receptor (RXR), peroxisome proliferator-activated 
receptor y (PPARy), and the estrogen and progesterone receptors ( Bourget et 
aL, (1995) Nature 375: 377-82; Renaud et aL , (1995) Nature 378: 681-89; 
Nolteetal. , (1998) Nature 395: 137-43; Williams & Sigler , (1998) Nature 393: 
392-96; Shiau et aL , (1998) Ce//95: 927-37; Brzozowski et aL , (1997) Nature 

15 389:753-58). This observation is also presented in Table 2. 

VLB. Novel Structural Features of the Human PXR LBD 
While human PXR is similar to known nuclear receptor ligand binding 
domains, it contains several distinct features that appear critical to its function 

20 as a promiscuous xenobiotic receptor. First, the variable region typically 
found between a1 and a3 in other nuclear receptors is replaced by a four- 
residue turn in human PXR, as depicted in Figure 1. In the peroxisome 
proliferator-activated receptors (PPARs), this region contains a2 and is the 
proposed ligand access site for the binding cavity ( Nolte et aL , (1998) Nature 

25 395: 137-43; Gampe etaL , (2000) Mol. Cell 5: 545-55. This region cannot be 
the access site in hPXR because it is occluded by the turn formed by residues 
171-174. These structural features suggest that a distinct ligand entry site 
exists in hPXR. 

Second, helix a6, which is common to many nuclear receptors, is 
30 replaced in hPXR by a conserved, flexible loop composed of residues 309- 
321 (See Figures 1 and 13). This region lies adjacent in space to the ligand 
binding cavity of hPXR. This feature might be related to the ability of the 
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hPXR receptor to accommodate both small and large ligands in the binding 

cavity. 

Third, hPXR has two additional p-strands that have not previously been 
observed in a nuclear receptor ligand binding domain. This feature is 
5 depicted in Figure 1. Weatherman et al. , (1999) Annu. Rev. Biochem. 68: 
559-81. These additional strands form the fourth (p1, residues 210 to 217) 
and fifth (p1\ residues 221 to 226) strands of a five-stranded anti-parallel p- 
sheet. An "insertion domain" containing roughly the same number of residues 
was engineered out of the VDR prior to crystallization and structure 

10 determination; ( Rochel et al. , (2000) Mol. Cell 5:173-79). However, small 
angle x-ray scattering ( Rochel et al. , Eur. J, Biochem. 268: 971-79) suggests 
that the 51-amino acid insertion domain in VDR (residues 164-216) is 
unstructured, and shares only 12% sequence identity with the analogous 45- 
amino acid (residues 184 to 229) in PXR. p1\ at the base of this five- 

15 stranded beta-sheet, forms a novel homodimerization interaction in hPXR. As 
discussed herein below, these structural features appear to play a role in 
ligand binding, accessibility of the binding cavity, and the formation of higher- 
order oligomers of hPXR. 

20 VI. C. The Ligand Binding Cavity of Human PXR 

The ligand binding cavity of the hPXR-LBD is largely hydrophobic and 
might be flexible in nature, in order to accommodate both small and large 
ligands. The 28 amino acid residues desolvated by the binding of SR12813, 
which are presented in Table 3, are considered to be lining the cavity. The 

25 structures of the apo and the ligand bound cavities are similar, exhibiting a 
1.12 A RMSD over all atoms in these 28 residues. Table 2 shows that the 
binding cavity volume of 1,150 A 3 , making it larger than most known nuclear 
receptor ligand binding cavities. Twenty of the cavity-lining residues are 
hydrophobic, four are polar (Ser-208,- Ser-247, Cys-284 and Gln-285), and 

30 four are charged or potentially charged (Glu-321, His-327, His-407 and Arg- 
410). This observation is depicted via graphics in Figures 2 and 3 and is 
additionally presented in Table 3. As described further herein below, Glu-321 
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and Arg-410 are involved in a salt bridge, effectively neutralizing their charged 
character adjacent to the ligand binding cavity. Thus, an electrostatic view of 
the inner surface of this ligand binding cavity reveals a relatively uncharged 
and hydrophobic environment, as seen in Figures 2 and 3. 
5 Spaced throughout this cavity are five critical polar residues which form 

key binding interactions with SR12813 (Ser-208, Ser-247, Gln-285, His-407 
and Arg-410, also indicated in Figures 2 and 3). These residues are capable 
of hydrogen bonding and are spaced evenly throughout the upper portion of 
the ligand binding cavity. The precise placement of polar residues is found to 
10 be critical for .defining the activation profile of mouse PXR with respect to 
specific ligands. 

hPXR is able to bind both small and large ligands. The loop involving 
residues 309-321 in hPXR (comprising the sequence EDTAGGFQQLLLE, 
SEQ ID NO: 5) replaces a6 observed previously in reported nuclear receptor 

15 ligand binding domain structures. This highly mobile region spans the space 
between the C-terminus of p4 to the N-terminus of a7, and exhibits a mean 
thermal displacement parameter of 82.3 A 2 over main-chain atoms despite 
persistent electron density. Nine of the thirteen residues in this loop are 
completely conserved in the mammalian PXR molecules of known sequence 

20 (highlighted in bold in SEQ ID NO: 5 above, and depicted in Figures 2 and 3), 
including the solvent-exposed hydrophobic residues of SEQ ID NO: 5, Phe- 
315, Leu-318, Leu-319 and Leu-320. The 309-321 loop is linked to the ligand 
binding cavity of hPXR by a non-solvent accessible pore. This observation is 
depicted in Figures 2 and 3. The binding of a particularly large and 

25 hydrophobic iigand in the binding cavity (e.g. rifampicin) might force the pore 
to this loop region to open, enlarging the ligand binding region of hPXR and 
lining it with additional hydrophobic amino acid side chains. 

VI. D. Three Orientations of SR12813 in the Ligand binding Cavity 
30 Three distinct binding modes of the high-affinity ligand SR12813 

(positions 1, 2 and 3) were observed in the ligand binding cavity of hPXR, 
shown in Figures 4-9. These orientations were identified during structural 
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refinement and were rigorously confirmed using difference maps involving 
data obtained from crystals containing an iodinated form of SR12813 (I- 
SR12813 in Figures 4-9). As shown in Figures 4-9, each orientation forms 
distinct interactions with residues that line the ligand binding cavity of PXR. 
5 While ligand position 3 forms the most hydrophilic interactions with the 
protein, positions 1 and 2 were clearly indicated in the detailed examination of 
difference maps and in refinement. Indeed, when any one of the orientations 
is not considered in an unbiased refinement, clear positive electron density 
appears in difference maps to indicate its presence in the binding cavity of the 
10 protein. 

Of the nineteen residues involved in contacting these orientations of 
SR12813, only Phe-288 interacts with all three ligand conformations. A 
phenylalanine residue is conserved at position 288 in the known mammalian 
PXR sequences. The remainder of the hydrophobic residues contact either 

15 two orientations (Leu-209, Phe-251, Phe-281, Trp-299) or one orientation 
(Leu-206, Val-211, Leu-240, Met-243, Met-246, Met-323, Leu-324, Phe-420) 
of the ligand. Among the polar side chains, two (Ser-247, His-407) interact 
with two orientations of the ligand, while the remaining four (Ser-208, Cys- 
284, Gln-285, Arg-410) interact with only one orientation. All but five of these 

20 residues are conserved by either identity or similarity in the mammalian PXR 
sequences reported to date, an observation presented in Table 3. 

V1.D.1. The Ligand Binding Promiscuity of PXR 
The nuclear receptor PXR serves as a key component of the body's 
25 defense mechanism against xenobiotics by detecting these compounds and 
regulating the transcription of genes involved in their metabolism (Waxman, 
(1999) Arch. Biochem Biophys. 369: 11-23; Savas et al. , (1999) Mol. 
Pharmacol. 56: 851-57). Unlike other nuclear receptors, which evolved to 
interact selectively with their cognate hormone, PXR evolved the ability to 
30 interact promiscuously with a structurally diverse collection of hydrophobic 
compounds. The x-ray crystal structures of the apo and SR12813-bound 
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PXR-LBD reveal several important features underlying PXR's promiscuous 
ligand binding properties. 

First, the volume of the PXR ligand binding cavity is >1,100 A 3 , which is 
substantially larger than that of many other nuclear receptors including the 
5 progesterone, estrogen, retinoid, and thyroid hormone receptors, an 
observation represented in Table 2. The PXR structure disclosed in the 
present invention also suggests that the ligand binding cavity may be capable 
of expanding to an even larger size. A stretch of conserved residues 309-321 
(identified in Figures 1-3) loop out and away from the ligand binding cavity of 

10 PXR in the .structures reported here, exposing conserved hydrophobic 
residues to solvent. This loop is connected to the existing ligand binding 
cavity by a solvent-inaccessible pore. These observations suggest that PXR 
might rearrange its ligand binding cavity when confronted by larger ligands, 
such as rifampicin. If the pore were to open between the ligand binding cavity 

15 and the 309-321 loop, the conserved hydrophobic residues currently exposed 
to solvent might be used to line a larger hydrophobic binding cavity. Thus, 
structural flexibility in the ligand binding cavity can contribute to PXR's ability 
to bind both to small hydrophobic ligands as well as to the much larger 
compound rifampicin. 

20 Second, the position of the ligand entry site appears to be unique in 

PXR. Instead of being near the AF2-helix as in the steroid, retinoid, and 
thyroid hormone receptors or in the front of the molecule as in the PPARs, the 
entry site appears to be located on the back of the molecule between a7 and 
a10. The putative entrance-exit path to the ligand binding cavity in hPXR 

25 might be gated by two salt bridges (i.e. between Asp-205 and Arg-413, and 
Glu-321 and Arg-410), as shown in Figure 10. Mutagenesis studies disclosed 
in the present invention show that Arg-410 and Asp-205, which are in van der 
Waals contact with each other, are critical to defining the appropriate basal 
activity level for PXR. When Asp-205 is mutated to alanine, the basal activity 

30 level of PXR drops dramatically in cell-based reporter gene assays. In 
contrast, changing Arg-410 to alanine increases the basal activity level of 
PXR activity, as indicated in Figure 11. The interactions of Glu-321 and Arg- 



WO 02/095652 



PCT/US02/15701 



-42- 

413 with Arg-410 and Asp-205, respectively, might be necessary simply to 
define the locations of Asp-205 and Arg-410; the correct positioning of these 
two residues appears to tune the transcriptional activity of PXR. Perhaps 
during the opening and closing of this gate to allow for the movement of 
5 ligands, a vital intermediate is formed by the interaction of Arg-410 with Asp- 
205. The drop in basal activity level in the Asp-205-Ala mutant might also 
indicate that this form of the protein is less stable than the wildtype. Asp-205 
could be critical for ordering the 198-210 region of the molecule; the loss of 
this residue could introduce flexibility to PXR that affects its stability and basal 

10 activation level 

Third, the ligand binding cavity of PXR is relatively smooth and 
hydrophobic, but contains a small number of key polar residues. Twenty of 
the 28 residues lining the binding cavity are hydrophobic; the remaining polar 
residues are spaced roughly evenly throughout the cavity, offering the 

15 potential of a small number of hydrogen bonds. The character of the PXR 
ligand cavity mirrors the character of the majority of the ligands known to 
activate PXR, such as phenobarbital, clotrimazole, RU486, hyperforin, PCN 
and lovastatin, which are largely hydrophobic and uncharged, but which 
contain oneto four functional groups capable of forming hydrogen bonds. 

20 A combination of hydrophobic and shuffled polar interactions allows the 

potent PXR agonist SR12813 to bind to the hPXR ligand binding cavity in 
three distinct orientations. Although SR12813 occupies essentially the same 
portion of the cavity in each of the three binding modes, each mode is 
facilitated by a different set of hydrogen bonding and hydrophobic interactions 

25 (Figures 4-6). For example, Ser-247 forms a hydrogen bond with the phenolic 
hydroxyl group on SR12813 in one binding mode, and a hydrogen bond with a 
phosphate group in another. Thus, PXR does not define a specific ligand 
binding surface that will drive the binding of ligands in one mode. Instead, 
PXR offers an essentially smooth and uncharged ligand binding surface, with 

30 evenly spaced hydrogen bond donors and acceptors (Figures 2 and 3). Such 
an arrangement not only permits PXR to bind to structurally distinct ligands, 
but also allows single ligands to bind in multiple modes. This binding mode 
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stands in sharp contrast to other nuclear receptor-ligand interactions, which 
have evolved to be highly specific. 

Vl-E. Electrostatic Interactions Adjacent to the Ligand binding Cavity 
5 Two salt-bridges occur across the region of the ligand binding cavity 

that is closest to the surface of hPXR and are depicted in Figure 10: Arg-410 
with Glu-321, and Arg-413 with Asp-205. Asp-205, Glu-321 and Arg-413 are 
almost completely conserved in the PXR molecules of known sequence; 
rabbit PXR contains a glutamic acid at position 205. Arg-410, however, is 

10 less conserve^; in human and rabbit PXR, this residue is an arginine or lysine, 
while in rat and mouse it is a glutamine. 

To examine the functional relevance of the Asp-205/Arg-413 and Glu- 
321 /Arg-410 salt bridges, each of these residues was individually mutated to 
alanine. Expression plasmids for each of the four mutant PXRs were co» 

15 transfected into CV-1 cells together with the XREM-CYP3A4-LUC, reporter 
plasmid containing the distal enhancer and proximal promoter of CYP3A4, 
which include two high-affinity PXR binding sites, driving expression of the 
luciferase gene ( Goodwin et al. , (1999) Mol. Pharmacol. 56: 1329-39). 
Transfected cells were treated with increasing concentrations of either 

20 SR12813 or rifampicin, as illustrated in the plots of Figure 11. Mutation of 
Asp-205 resulted in a marked decrease in the basal (ligand-independent) 
transcriptional activity of PXR, while mutation of Arg-413 resulted in a more 
modest reduction in PXR basal activity. In the other salt bridge, mutation of 
Glu-321 resulted in a marked increase in the basal activity of PXR; 

25 conversely, the Arg-410-Ala mutant had reduced basal activity. Taken 
together, these data demonstrate that the residues that form these salt 
bridges are important in determining the basal activity of the PXR. Moreover, 
the data highlight the asymmetry in the functions of the residues comprising 
the salt bridges: mutation of either Arg-410 or Asp-205 has a more dramatic . 

30 effect than alteration of their salt bridge partners, suggesting that these two 
residues are particularly critical for determining the basal transcriptional 
activity of PXR. 
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The mutant PXR LBD polypeptides Asp-205-Ala, Arg-413-Ala and Glu- 
321-Ala were all activated by both SR12813 and rifampicin, although the 
dynamic range of" the response was compressed relative to the wildtype 
receptor for the Glu-321-Ala mutant. Interestingly, rifampicin was a more 
5 potent activator of the Asp-205-Ala mutant than wildtype PXR. This effect 
was not seen with SR12813, suggesting that SR12813 and rifampicin bind to 
PXR in different ways. These results also indicate that mutation of residues 
involved in salt bridges can affect the potency and selectivity with which 
compounds activate PXR. 

10 

VI. F. hPXR Forms a Novel Homodimer 

A remarkable observation arising from these structural studies was that 
the ligand binding domain of hPXR forms a homodimer through the p-sheet 
structure that is unique to PXR. pr from the pi-pr region of this structure 

15 forms an ideal anti-parallel p-sheet interaction with pr from a 
crystallographically-related monomer, as shown in Figure 12. The dimer 
interface appears to be stabilized in two ways. First, six 2.9-3.0 A main-chain 
to main-chain hydrogen bonds in an anti-parallel p-sheet. pattern occur across 
the interface. This generates a ten-stranded anti-parallel p-sheet between the 

20 two monomers, highlighted in Figure 13. Second, two aromatic side chains 
are observed to be buried which lends a large degree of specificity to this 
interaction. Notably, the Trp-223 side chains from each monomer are locked 
across the dimer interface, and thus are completely protected from solvent, 
permitting them to form an "offset-edge" stacking interaction frequently 

25 observed between tryptophans (Samanta et a!. , (1999) Acta Crystalog. D 55: 
1421-27). Tyr-225 is also involved in this interface and becomes buried in the 
dimer, packing against Pro-175 from the region leading up to residue 177. 
The position of Trp-223 is further stabilized by a 3.0 A hydrogen bond 
between the main-chain carbonyl oxygen of Pro-175 and the indol ring 

30 nitrogen of the tryptophan side chain. In all, this highly specific dimer 
interface buries about 1,600 A 2 of solvent accessible surface area on each 
monomer. Trp-223, Tyr-225 and Pro-175 are conserved in all the reported 
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mammalian PXR sequences. The evolutionary conservation of the residues 
involved in this interaction supports its role in the biological function of PXR. 

To assess whether this homodimerization interface is functionally 
relevant in vivo, reporter gene experiments were performed with a PXR 
5 double mutant in which Trp-223 and Tyr-225 were mutated to alanine. The 
Trp-223-A!a/Tyr-225-Ala mutant hPXR polypeptide had dramatically reduced 
basal activity and was only very weakly responsive to either SR12813 or 
rifampicin, as shown in Figure 14. Gel mobility shift assays revealed that this 
double mutant retains the ability to bind to DNA as a heterodimer with RXRcc. 

10 Taken together, these data suggest an unprecedented role for a novel 
homodimerization interface in PXR-mediated transcriptional activation. 

A model for a putative heterotetrameric complex formed by the ligand 
binding domains of hPXR and RXRa was generated by superimposing PXR 
on PPARy in the heterodimeric structure of PPARy and RXRa ( Gampe et a). , 

1 5 (2000) Mol Cell 5: 545-55). The interface between PXR and RXRa generated 
in this fashion is nearly ideal, requiring no manual optimization and producing 
no van der Waals overlaps between atoms. Importantly, the 
homodimerization of PXR does not interfere with the PXR/RXRa interface in 
the heterotetramer, suggesting that a heterotetramer of PXR and RXRa can 

20 or could occur in vivo. 

Vl-G Structure-Based Receptor Design: Altering the Ligand 

Specificity of Mouse PXR By Four Targeted Site-Directed 
Mutants 

25 Marked differences in the pharmacological activation profiles of PXR 

occur across species. For example, human and rabbit PXR are activated 
efficiently by SR12813 whereas mouse PXR is not. Conversely, mouse PXR 
is activated more efficiently by the synthetic steroid PCN than either the 
human or rabbit orthologs ( Jones et al. . (2000) Mol. Endocrinol. 14: 27-39; 

30 Savas et al. , (2000) Drug Metab Dispos. 28: 529-37). To verify the 
importance of polar residues in the selective PXR-SR12813 interaction, 
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mouse PXR was "humanized" - that is, its selectivity was altered such that it 
would respond to SR12813 but not to PCN. 

Four residues that interact with SR12813 in the hPXR crystal structure 
and differ between human and mouse PXR were chosen for mutagenesis: 
5 Leu-206/Arg-203, Ser-208/Pro-205, His-407/Gln-404, Arg-4107Gln-407 
(human residues in each pair are listed first). Each of these residues was 
mutated in the context of a mouse PXR expression plasmid to the 
corresponding amino acid in the hPXR (R203L, P205S, Q404H, Q407R; using 
the mouse numbering scheme). As expected, wildtype mouse PXR 

10 responded to PCN and was only weakly activated by SR12813 in reporter 
assays (Figure 5). However, the R203L, P205S, Q404H, Q407R mutant 
(mouse->human PXR) was no longer activated by PCN but was activated 
efficiently by SR12813, mirroring the hPXR activation profile, as shown by the 
bar graphs presented in Figure 15. Thus, mouse PXR's response to 

15 particular ligands was "humanized" by employing the crystal structure to 
design targeted mutants. These results show that the selectivity of PXR for 
certain ligands resides in the number and position of only a few polar residues 
within the molecule's large, hydrophobic ligand binding cavity. 

20 V1.G.1. Species-Specific Activation by Ligand in the PXRs 

PXR is promiscuous in that it binds a wide variety of endogenous 
compounds and xenobiotics. However, this receptor also exhibits specificity 
as demonstrated by the fact that PXRs from different species show distinct 
activation profiles. To test the hypothesis that key polar residues determine 

25 specificity, the crystal structures of hPXR disclosed in the present invention 
were employed to design an altered form of PXR. Mutation of only four 
residues in the mouse PXR ligand binding cavity (Arg-203, Pro-205, Gln-404, 
Gln-407) to the corresponding residues in the human receptor (Leu-206, Ser- 
208, His-407, Arg-410, respectively) results in a mouse->human PXR that is 

30 activated efficiently by SR12813 but not by PCN, which a rodent-selective 
PXR ligand (Figure 15). Thus, a "humanized" mouse PXR was created by 
altering four polar amino acids. This indicates that the relative positions of 
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critical polar residues are sufficient for achieving selectivity in this receptor. 
Promiscuity appears to arise from the general hydrophobic nature and 
relatively small number of polar group within the cavity. 

5 Vl-H. Generation of Easily-Solved PXR Crystals 

The present invention discloses a substantially pure PXR LBD 
polypeptide in crystalline form. In a preferred embodiment, exemplified in the 
Figures and Laboratory Examples, PXR is crystallized with bound ligand. 
Crystals are formed from PXR LBD polypeptides that are usually expressed 
10 by a cell culture, such as E co/L BYomo-, iodo- and substitutions can be 
included during the preparation of crystal forms and can act as heavy atom 
substitutions in PXR ligands and in crystals of PXR and the PXR LBD. This 
method can be advantageous for the phasing of the crystal, which is a crucial, 
and sometimes limiting, step in solving the three-dimensional structure of a 
15 crystallized entity. Thus, the need for generating the heavy metal derivatives 
traditionally employed in crystallography might be eliminated. After the three- 
dimensional structure of a PXR or PXR LBD with or without a ligand bound is 
determined, the resultant three-dimensional structure can be used in 
computational methods to design synthetic figands for PXR and other PXR 
20 polypeptide fragments. Further activity structure relationships can be 
determined through routine testing, using assays disclosed herein and known 
in the art. 

MIL Uses of PXR Crystals and the Three-Dimensional Structure of the 
Ligand Binding Domain of PXR 
VIIA Design and Development of PXR Modulators 
The knowledge of the structure of the hPXR ligand binding domain, an 
aspect of the present invention, provides a tool for investigating the 
mechanism of action of hPXR and other PXR polypeptides in a subject. For 
example, various computer models, as described herein, can predict the 
binding of various substrate molecules to the LBD of hPXR. Upon discovering 
that such binding in fact takes place, knowledge of the protein structure then 
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allows design and synthesis of small molecules that mimic the functional 
binding of the substrate to the LBD of hPXR. This is the method of "rational" 
drug design, further described herein. 

Use of the isolated and purified hPXR crystalline structure of the 
5 present invention in rational drug design is thus provided in accordance with 
the present invention. Additional rational drug design techniques are 
described in U.S. Patent Nos. 5,834,228 and 5,872,011, incorporated herein 
in their entirety. 

Thus, in addition to the compounds described herein, other sterically 
10 similar compounds can be formulated to mimic the key structural regions of 
PXRs in general, or of hPXR- in particular. The generation of a structural 
functional equivalent can be achieved by the techniques of modeling and 
chemical design known to those of skill in the art and described herein. It will 
be understood that all such sterically similar constructs fall within the scope of 
1 5 the present invention. 



VII.A.1. Rational Drug Design 

The three-dimensional structure of ligand binding hPXR is 
20 unprecedented and will greatly aid in the development of new synthetic 
ligands for a PXR polypeptide, such as PXR agonists and antagonists, 
including those that bind exclusively to any one of the PXR orthologs. In 
addition, PXR is well suited to modern methods, including three-dimensional 
structure elucidation and combinatorial chemistry, such as those disclosed in 
25 U.S. Patent No. 5,463,564, incorporated herein by reference. Structure 
determination using X-ray crystallography is possible because of the solubility 
properties of the PXR orthologs. Computer programs that use crystallography 
data when practicing the present invention will enable the rational design of 
ligands to these receptors. Programs such as RASMOL (Biomolecular 
30 Structures Group, Glaxo Wellcome Research & Development Stevenage, 
Hertfordshire, UK Version 2.6, August 1995, Version 2.6.4, December 1998, 
Copyright © Roger Sayle 1992-1999) can be used with the atomic structural 
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coordinates from crystals of the present invention, crystals generated by 
practicing the invention or crystals used to practice the invention by 
generating three-dimensional models and/or determining the structures 
involved in ligand binding. Computer programs such as those sold under the 

5 registered trademark INSIGHT II® and such as GRASP ( Nicholls et al. , (1991) 
Proteins 11: 281-96) allow for further manipulations and the ability to 
introduce new structures. In addition, high throughput binding and bioactivity 
assays can be devised using purified recombinant protein and modem 
reporter gene transcription assays known to those of skill in the art in order to 

10 refine the activity of a designed ligand. 

A method of identifying modulators of the activity of a PXR polypeptide 
using rational drug design is thus provided in accordance with the present 
invention. The method comprises designing a potential modulator for a PXR 
polypeptide of the present invention that will form non-covalent bonds with 

15 amino acids in the ligand binding cavity based upon the crystalline structure of 
the hPXR LBD polypeptide; synthesizing the modulator; and determining 
whether the potential modulator modulates the activity of the PXR 
polypeptide. In a preferred embodiment, the modulator is designed for a 
hPXR polypeptide. Preferably, the hPXR polypeptide comprises the amino 

20 acid sequence of SEQ ID NO: 2, and the hPXR LBD comprises the amino 
acid sequence SEQ ID NO: 4. The determination of whether the modulator 
modulates the biological activity of a PXR polypeptide is made in accordance 
with the screening methods disclosed herein, or by other screening methods 
known to those of skill in the art. Modulators can be synthesized using 

25 techniques known to those of ordinary skill in the art. 

In an alternative embodiment, a method of designing a modulator of a 
PXR polypeptide in accordance with the present invention is disclosed 
comprising: (a) selecting a candidate PXR ligand; (b) determining which 
amino acid or amino acids of a PXR polypeptide interact with the ligand using 

30 a three-dimensional model of a crystallized hPXR LBD; (c) identifying in a 
biological assay for PXR activity a degree to which the ligand modulates the 
activity of the PXR polypeptide; (d) selecting a chemical modification of the 
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ligand wherein the interaction between the amino acids of the PXR 
polypeptide and the ligand is predicted to be modulated by the chemical 
modification; (e) performing the chemical modification on the ligand to form a 
modified ligand; (f) contacting the modified ligand with the PXR polypeptide; 
5 (g) identifying in a biological assay for PXR activity a degree to which the 
• modified ligand modulates the biological activity of the PXR polypeptide ; and 
(h) comparing the biological activity of the PXR polypeptide in the presence of 
modified ligand with the biological activity of the PXR polypeptide in the 
presence of the unmodified ligand, whereby a modulator of a PXR polypeptide 
10 is designed. . 

Vll.A.2. Methods for Using the hPXR LBD Structural Coordinates 

For Molecular Design 
For the first time, the present invention permits the use of molecular 

15 design techniques to design, select and synthesize chemical entities and 
compounds, including modulatory compounds, capable of binding to the 
ligand binding cavity or an accessory binding site of hPXR and the hPXR 
LBD, in whole or in part. Correspondingly, the present invention also provides 
for the application of similar techniques in the design of modulators of any 

20 PXR polypeptide. 

In accordance with a preferred embodiment of the present invention, 
the structure coordinates of a crystalline hPXR LBD can be used to design 
compounds that bind to a PXR LBD (more preferably a hPXR LBD) and alter 
the properties of a PXR LBD (for example, the dimerization or ligand binding 

25 ability) in different ways. One aspect of the present invention provides for the 
design of compounds that act as competitive inhibitors of a PXR polypeptide 
by binding to all, or a portion of, the binding sites on a PXR LBD. The present 
invention also provides for the design of compounds that can act as 
uncompetitive inhibitors of a PXR LBD. These compounds can bind to all, or 

30 a portion of, an accessory binding site of a PXR that is already binding its 
ligand and can, therefore, be more potent and less non-specific than known 
competitive inhibitors that compete only for the PXR ligand binding cavity. 
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Similarly, non-competitive inhibitors that bind to and inhibit PXR LBD activity, 
whether or not it is bound to another chemical entity, can be designed using 
. the PXR LBD structure coordinates of this invention. 

A second design approach is to probe a PXR or PXR LBD (preferably a 
5 hPXR or hPXR LBD) crystal with molecules comprising a variety of different 
chemical entities to determine optimal sites for interaction between candidate 
PXR or PXR LBD modulators and the polypeptide. For example, high 
resolution X-ray diffraction data collected from crystals saturated with solvent 
allows the determination of the site where each type of solvent molecule 

10 adheres. Small molecules that bind tightly to those sites can then be 
designed, synthesized and tested for their hPXR modulator activity. 

Once a computationally-designed ligand is synthesized using the 
methods of the present invention or other methods known to those of skill in 
the art, assays can be used to establish its efficacy of the ligand as a 

15 modulator of PXR (preferably hPXR) activity. After such assays, thejigands 
can be further refined by generating intact PXR, or PXR LBD, crystals with a 
ligand bound to the PXR. The structure of the ligand can then be further 
refined using the chemical modification methods described herein and known 
to those of skill in the art, in order to improve the modulation activity or the 

20 binding affinity of the ligand. This process can lead to second generation 
ligands with improved properties. ' 

Ligands also can be selected that modulate PXR responsive gene 
transcription by the method of altering the interaction of co-activators and co- 
repressors with their cognate PXR. For example, agonistic ligands can be 

25 selected that block or dissociate a co-repressor from interacting with the PXR, 
and/or that promote binding or association of a co-activator. Antagonistic 
ligands can be selected that block co-activator interaction and/or promote co- 
repressor interaction with a target receptor. Selection can be done via binding 
assays that screen for designed ligands having the desired modulatory 

30 properties. Preferably, interactions of a hPXR polypeptide are targeted. 
Suitable assays for screening that can be employed, mutatis mutandis in the 
present invention, are described in published PCT international applications 
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WO 00/037077 and WO 00/025134, incorporated herein by reference in their 
entirety. 

Vll.A.3. Methods of Designing PXR LBD Modulator Compounds 

5 The design of candidate substances, also referred to as "compounds" 

• or "candidate compounds", that bind to or inhibit PXR LBD-mediated activity 
according to the present invention generally involves consideration of two 
factors. First, the compound must be capable of physically and structurally 
associating with a PXR LBD. Non-covalent molecular interactions important 

10 in the association of a PXR LBD with its substrate include hydrogen bonding, 
van der Waals interactions and hydrophobic interactions. 

Second, the compound must be able to assume a conformation that 
allows it to associate with a PXR LBD. Although certain portions of the 
compound will not directly participate in this association with a PXR LBD, 

15 those portions can still influence the overall conformation of the molecule. 
This, in turn, can have a significant impact on potency. Such conformational 
requirements include the overall three-dimensional structure and orientation of 
the chemical entity or compound in relation to all or a portion of the binding 
site, e.g., the ligand binding cavity or an accessory binding site of a PXR LBD, 

20 or the spacing between functional groups of a compound comprising several 
chemical entities that directly interact with a PXR LBD. 

The potential modulatory or binding effect of a chemical compound on 
a PXR LBD can be analyzed prior to its actual synthesis and testing by the 
use of computer modeling techniques that employ the coordinates of a 

25 crystalline hPXR LBD polypeptide of the present invention. If the theoretical 
structure of the given compound suggests insufficient interaction and 
association between it and a PXR LBD, synthesis and testing of the 
compound is obviated. However, if computer modeling indicates a strong 
interaction, the molecule can then be synthesized and tested for its ability to 

30 bind and modulate the activity of a PXR LBD. In this manner, synthesis of 
unproductive or inoperative compounds can be avoided. 
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A modulatory or other binding compound of a PXR LBD polypeptide 
(preferably a hPXR LBD) can be computationally evaluated and designed via 
a series of steps in which chemical entities or fragments are screened and 
selected for their ability to associate with the individual binding sites or other 
5 areas of a crystalline hPXR LBD polypeptide of the present invention. 

One of several methods can be used to screen chemical entities or 
fragments for their ability to associate with a PXR LBD and, more particularly, 
with the individual binding sites of a PXR LBD, such as ligand binding cavity 
or an accessory binding site. This process can begin by visual inspection of, 
10 for example, the ligand binding cavity on a computer screen based on the 
hPXR LBD atomic coordinates in Table 4. Selected fragments or chemical 
entities can then be positioned in a variety of orientations, or docked, within 
an individual binding site of a hPXR LBD as defined herein above. Docking 
can be accomplished using software programs such as those available under 
15 the tradenames QUANTA™ (Molecular Simulations Inc., San Diego, 
California) and SYBYL™ (Tripos, Inc., St. Louis, Missouri), followed by energy 
minimization and molecular dynamics with standard molecular mechanics 
forcefields, such as CHARM ( Brooks et al. , (1983) J. Comp. Chem., 8: 132) 
and AMBER 5 ( Case et al. , (1997), AMBER 5, University of California, San 
20 Francisco; Pearlman et al. , (1995) Comput. Phys. Commun. 91: 1-41). 

Specialized computer programs can also assist in the process of 
selecting fragments or chemical entities. These include: 

1. GRID™ program, version 17 ( Goodford , (1985) J. Med Chem. 28: 
849-57), which is available from Molecular Discovery Ltd., Oxford, UK; 
25 2. MCSS™ program ( Miranker & Karplus , (1991) Proteins 11: 29-34), 

which is available from Molecular Simulations, Inc., San Diego, California; 

3. AUTODOCK™ 3.0 program ( Goodsell & Olsen , (1990) Proteins 8: 
195-202), which is available from the Scripps Research Institute, La Jolia, 
California; 

30 4. DOCK™ 4.0 program ( Kuntz et al. , (1992) J. Mol. Biol. 161: 269- 

88), which is available from the University of California, San Francisco, 
California; 
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5. FLEX-X™ program (See, Rarey et al. , (1996) J. Comput. Aid. Mol. 
Des. 10:41-54), which is available from Tripos, Inc., St. Louis, Missouri; 

6. MVP program (Lambert , (1997) in Practical Application of 
Computer-Aid ed Drug Design , (Charifson, ed.) Marcel-Dekker, New York, pp. 

5 243-303); and 

7. LUDI™ program (Bohm , (1992) J. Comput Aid. Mol. Des., 6: 61- 
78), which is available from Molecular Simulations, Inc., San Diego, California. 

Once suitable chemical entities or fragments have been selected, they 
can be assembled into a single compound or modulator. Assembly can 
10 proceed by visual inspection of the relationship of the fragments to each other 
on the three-dfmensional image displayed on a computer screen in relation to 
the structure coordinates of a hPXR LBD. Manual model building using 
software such as QUANTA™ or SYBYL™ typically follows. 

Useful programs to aid one of ordinary skill in the art in connecting the 
1 5 individual chemical entities or fragments include: 

1. CAVEAT™ program ( Bartlett et al. , (1989) Special Pub;', Royal '* 
Chem. Soc. 78: 182-96), which is available from the University of California, 
Berkeley, California; 

2. 3D Database systems, such as MACCS-3D™ system program, 
20 which is available from MDL Information Systems, San Leandro, California. 

This area is reviewed in Martin , (1992) J. Med. Chem. 35: 2145-54; and 

3. HOOK™ program ( Eisen et al. , (1994). Proteins 19: 199-221), 
which is available from Molecular Simulations, Inc., San Diego, California. 

Instead of proceeding to build a PXR LBD modulator (preferably a 
25 hPXR LBD modulator) in a step-wise fashion one fragment or chemical entity 
at a time as described above, modulatory or other binding compounds can be 
designed as a whole or cfe novo using the structural coordinates of a 
crystalline hPXR LBD polypeptide of the present invention and either an 
empty binding site or optionally including some portion(s) of a known 
30 modulator(s). Applicable methods can employ the following software 
programs: 
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1. LUDI™ program (Bohm, (1992) J. Comput Aid. Mol. Des. t 6: 61- 
78), which is available from Molecular Simulations, Inc., San Diego, California; 

2. LEGEND™ program ( Nishibata & Itai , (1991) Tetrahedron 47: 
8985); and 

5 3. LEAPFROG™, which is available from Tripos Associates, St. Louis, 

Missouri. 

Other molecular modeling techniques can also be employed in 
accordance with this invention. See, e.g. , Cohen et al. , (1990) J. Med. Chem. 
33: 883-94. See also , Navia & Murcko , (1992) Curr. Op/n. Struc. Biol. 2: 202- 

10 10; U.S. Patent No. 6,008,033, herein incorporated by reference. 

Once a 4 compound has been designed or selected by the above 
methods, the efficiency with which that compound can bind to a PXR LBD can 
be tested and optimized by computational evaluation. By way of particular 
example, a compound that has been designed or selected to function as a 

15 hPXR LBD modulator should also preferably traverse a volume not 
overlapping that occupied by the binding site when it is bound to its native 
ligand. Additionally, an effective PXR LBD modulator should preferably 
demonstrate a relatively small difference in energy between its bound and 
free states (i.e., a small deformation energy of binding). Thus, the most 

20 efficient PXR LBD modulators should preferably be designed with a 
deformation energy of binding of not greater than about 10 kcal/mole, and 
preferably, not greater than 7 kcal/mole. It is possible for PXR LBD 
modulators to interact with the polypeptide in more than one conformation that 
is similar in overall binding energy. In those cases, the deformation energy of 

25 binding is taken to be the difference between the energy of the free compound 
and the average energy of the conformations observed when the modulator 
binds to the polypeptide. 

A compound designed or selected as binding to a PXR polypeptide 
(preferably a hPXR LBD polypeptide) can be further computationally 

30 optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target polypeptide. Such non-complementary 
(e.g., electrostatic) interactions include repulsive charge-charge, dipoie-dipole 
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and charge-dipole interactions. Specifically, the sum of all electrostatic 
interactions between the modulator and the polypeptide when the modulator 
is bound to a PXR iBD preferably make a neutral or favorable contribution to 
the enthalpy of binding. 
5 Specific computer software is available in the art to evaluate compound 

deformation energy and electrostatic interaction. Examples of programs 
designed for such uses include: 

1. Gaussian 98™ , which is available from Gaussian, Inc., Pittsburgh, 
Pennsylvania; 

10 2. AMBER™ program, version 6.0, which is available from the 

University of California at San Francisco; 

3. QUANTA™ program, which is available from Molecular Simulations, 
Inc., San Diego, California; 

4. CHARMm® program, which is available from Molecular Simulations, 
1 5 Inc., San Diego, California; and 

4. INSIGHT II® program, which is available from Molecular 
Simulations, Inc., San Diego, California. 

These programs can be implemented using a suitable computer 
system. Other hardware systems and software packages will be apparent to 
20 those skilled in the art after review of the disclosure of the present invention 
presented herein. 

Once a PXR LBD modulating compound has been optimally selected 
or designed, as described above, substitutions can then be made in some of 
its atoms or side groups in order to improve or modify its binding properties. 

25 Generally, initial substitutions are conservative, i.e., the replacement group 
will have approximately the same size, shape, hydrophobicity and charge as 
the original group. It should, of course, be understood that components 
known in the art to alter conformation should be avoided. Such substituted 
chemical compounds can then be analyzed for efficiency of fit to a PXR LBD 

30 binding site using the same computer-based approaches described in detail 
above. 
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VI1.B. Distinguishing Between PXR lsoforms and Orthologs 
The present invention discloses the ability to generate new synthetic 
ligands to distinguish between PXR isoforms and orthologs. As described 
herein, computer-designed ligands can be generated that distinguish between 
5 binding isoforms and orthologs, thereby allowing the generation of species 
specific, tissue specific or function specific ligands. The atomic structural 
coordinates disclosed in the present invention reveal structural details unique 
to hPXR. These structural details can be exploited when a novel ligand is 
designed using the methods of the present invention or other ligand design 

10 methods known in the art. The structural features that differentiate a hPXR 
from a mouse 'PXR and one isoform from another can be targeted in ligand 
design. Thus, for example, a ligand can be designed that will recognize a 
particular PXR isoform or ortholog, while not interacting with other PXR 
isoforms or orthologs, or even with moieties having similar structural features. 

15 Prior to the disclosure of the present invention, a detailed understanding of 
the differences between PXR orthoforms and the ability to target a particular 
PXR isoform or ortholog was unattainable. 

VII.C. Method of Screening for Chemical and Biological Modulators of 

20 the Biological Activity of hPXR 

A candidate substance identified according to a screening assay of the 
present invention has an ability to modulate the biological activity of a PXR 
polypeptide or a PXR LBD polypeptide. In a preferred embodiment, such a 
candidate compound can have utility in the treatment of disorders and 

25 conditions associated with the biological activity of a hPXR or a hPXR LBD 
polypeptide, including, but not limited to, hPXR and hPXR LBD-based drug- 
drug interactions, hPXR and hPXR LBD-based drug resistance, individualized 
treatment of disease due to polymorphisms, liver cholestasis and other 
degenerative liver disorders. 

30 In a cell-free system, the method comprises the steps of establishing a 

control system comprising a hPXR polypeptide and a ligand which is capable 
of binding to the polypeptide; establishing a test system comprising a hPXR 
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polypeptide, the ligand, and a candidate compound; and determining whether 
the candidate compound modulates the activity of the polypeptide by 
comparison of th$ test and control systems. A representative ligand 
comprises a fatty acid or other small molecule, and in this embodiment, the 
5 biological activity or property screened includes binding affinity. 

In another embodiment of the invention, a form of a hPXR polypeptide 
or a catalytic or immunogenic fragment or oligopeptide thereof, can be used 
for screening libraries of compounds in any of a variety of drug screening 
techniques. The fragment employed in such a screening can be affixed to a 

10 solid support. The formation of binding complexes, between a hPXR 
polypeptide ahd the agent being tested, will be detected. In a preferred 
embodiment, the hPXR polypeptide has an amino acid sequence of SEQ ID 
NO: 2. When a hPXR LBD polypeptide is employed, a preferred embodiment 
will include a hPXR polypeptide having the amino acid sequence of SEQ ID 

15 NO: 4. 

Another technique for drug screening which can be used provides for 
high throughput screening of compounds having suitable binding affinity to the 
protein of interest as described in published PCT application WO 84/03564, 
herein incorporated by reference. In this method, as applied to a polypeptide 

20 of the present invention, large numbers of different small test compounds are 
synthesized on a solid substrate, such as plastic pins or some other surface. 
The test compounds are reacted with the polypeptide, or fragments thereof. 
Bound polypeptide is then detected by methods well known to those of skill in 
the art. The polypeptide can also be placed directly onto plates for use in the 

25 aforementioned drug screening techniques. 

In yet another embodiment, a method of screening for a modulator of a 
hPXR polypeptide or a hPXR LBD polypeptide comprises: providing a library 
of test samples; contacting a hPXR polypeptide or a hPXR LBD polypeptide 
with each test sample; detecting an interaction between a test sample and a 

30 hPXR polypeptide or a hPXR LBD polypeptide; identifying a test sample that 
interacts with a hPXR polypeptide or a hPXR LBD polypeptide; and isolating a 
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test sample that interacts with a hPXR polypeptide or a hPXR LBD 
polypeptide. 

In each of the foregoing embodiments, an interaction can be detected 
spectrophotometrically, radiologically or immunologically. An interaction 
5 between a hPXR polypeptide or a hPXR LBD polypeptide and a test sample 
can also be quantified using methodology known to those of skill in the art. In 
another embodiment, the hPXR polypeptide and the hPXR LBD is in 
crystalline form. 

In accordance with the present invention there is also provided a rapid 
10 and high throughput screening method that relies on the methods described 
above. This Screening method comprises separately contacting each of a 
plurality of substantially identical samples with a hPXR polypeptide or a hPXR 
LBD and detecting a resulting binding complex. In such a screening method 
the plurality of samples preferably comprises more than about 10 4 samples, or 
15 more preferably comprises more than about 5 x 10 4 samples. 

VILD. Method of Identifying Compounds Which Inhibit Ligand Binding 
Using the crystal structures and ligand orientations, disclosed for the 
first time in the present invention, it is possible to design test compounds that 
inhibit binding of ligands normally bound by a PXR polypeptide. 

In one aspect of the present invention, an assay method for identifying 
a compound that inhibits binding of a ligand to a PXR polypeptide is 
disclosed. A known ligand of hPXR can be used in the assay method as the 
ligand against which the inhibition by a test compound is gauged. SR12813 is 
a preferred ligand in the assay method. The method comprises (a) incubating 
a PXR polypeptide with a ligand in the presence of a test inhibitor compound; 
(b) determining an amount of ligand that is bound to the PXR polypeptide, 
wherein decreased binding of ligand to the PXR polypeptide in the presence 
of the test inhibitor compound relative to binding in the absence of the test 
inhibitor compound is indicative of inhibition; and (c) identifying the test 
compound as an inhibitor of ligand binding if decreased ligand binding is 
observed. Preferably, the ligand is SR1 281 3. 
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In another aspect of the present invention, the disclosed assay method 
can be used in the structural refinement of candidate PXR inhibitors. For 
example, multiple rounds of optimization can be followed by gradual structural 
changes in a strategy ot inhibitor design. A strategy such as this is made 
5 possible by the disclosure of the coordinates of the hPXR LBD and the 
disclosure of the orientation of a ligand of PXR, SR12813. 

VILE. Design of PXR Isoform and Ortholog Modulators 
The hPXR crystal structure of the present invention can be used to 
10 generate modulators of other PXR isoforms or orthologs, such as mouse PXR 
(Swiss-Prot Accession No. 054915). Analysis of the disclosed crystal 
structure can provide a guide for designing modulators of PXR isoforms or 
orthologs. Purely for purposes of explanation, the development of a mouse 
PXR modulator will be considered herein below. It will be apparent to those of 
15 skill in the art, and explicitly noted here, that the following discussion will be 
applicable mutatis mutandis to PXR isoforms and other PXR orthologs, 
including rat PXR (Swiss-Prot Accession No. Q9R1A7). 

Absent the crystal structure of the present invention, researchers would 
be required to design mouse PXR modulators de novo. The present 
20 invention, however, addresses this problem by providing insights into the 
binding cavity of hPXR, which can be extended, due to significant structural 
similarity with other PXR isoforms and orthologs, to the binding cavity of, for 
example, mouse PXR. An evaluation of the binding cavity of hPXR indicates 
that a potential mouse PXR modulator would meet a broad set of general 
25 criteria. Broadly, it can be stated that, based on the crystal structure of hPXR, 
a potent mouse PXR ligand would require several general features including: 
(a) a hydrophobic binding cavity; and (b) the ability to adopt a conformation 
that is complementary to the shape of the binding cavity. 

Using the discerned structural similarities and differences between 
30 PXR isoforms and orthologs, as represented and predicted based on the 
crystal structure of the present invention and homology models, a mouse PXR 
modulator can be designed. For example, based on an evaluation of a 
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homology model of mouse PXR, which is derived from the hPXR crystal 
structure, it is expected that a potent ligand would need similar characteristics 
as listed above for a compound recognized by hPXR. Additional 
modifications can be included, based on the disclosed structure, which are 
5 predicted to further define a modulator specific for mouse PXR over other 
orthologs. Thus, the disclosed crystal structure of hPXR can be useful when 
designing modulators of mouse PXR and other orthologs and isoforms. 

An additional aspect of the present invention, then, is to provide a 
technique for predicting the differences in the metabolism of a drug between 

10 mice (or other species of interest) and humans. As described further herein, 
the "humanizaiion" of the mouse PXR via four point mutations demonstrate 
that the structures disclosed herein can be employed as a predictor of mouse 
PXR function. A mouse PXR model, based on the structure of human PXR, 
as disclosed herein and co-crystallized and refined with known 

15 pharmacophores, can provide needed insight into PXR activity and 
mechanisms. Notably, a "humanized" mouse PXR can provide valuable 
insight into the metabolism of drugs in humans. Particularly, the ability to 
generate such a system offers the potential to generate PXR drug metabolism 
data in a mouse system that is more predictive for human systems, than PXR 

20 metabolic data generated based only on a mouse model. A "humanized" 
mouse PXR also facilitates insights into existent drug metabolism, and the 
development of human drugs and therapeutics with greater efficacies, using a 
convenient mouse model system. 

25 VilL Design, Preparation and Structural Analysis of hPXR and hPXR LBD 
Mutants and Structural Equivalents 

The present invention provides for the generation of PXR and PXR 
mutants (preferably hPXR and hPXR LBD mutants), and the ability to solve 
the crystal structures of those that crystallize. More particularly, through the 
30 provision of the three-dimensional structure of a hPXR LBD, desirable sites 
for mutation can be identified, based on analysis of the three-dimensional 
hPXR LBD structure coordinates provided herein. 
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The structure coordinates of a hPXR LBD provided in accordance with 
the present invention also facilitate the identification of related proteins or 
enzymes analogous to hPXR in function, structure or both, (for example, a 
mouse PXR), which can lead to novel therapeutic modes for treating or 
5 preventing a range of disease states. 

VIII.A. Sterically Similar Compounds 

A further aspect of the present invention is that sterically similar 
compounds can be formulated to mimic the key portions of a PXR LBD 

10 structure. Such compounds are functional equivalents. The generation of a 
structural furictional equivalent can be achieved by the techniques of 
modeling and chemical design known to those of skill in the art and described 
herein. Modeling and chemical design of PXR and PXR LBD structural 
equivalents can be based on the structure coordinates of a crystalline hPXR 

15 LBD polypeptide of the present invention. It will be understood that all such 
sterically similar constructs fall within the scope of the present invention. 

VIII. B. PXR Polypeptides 

The generation of chimeric PXR polypeptides is also an aspect of the 
20 present invention. Such a chimeric polypeptide can comprise a PXR LBD 
polypeptide or a portion of a PXR LBD, (e.g. a hPXR LBD) which is fused to a 
candidate polypeptide or a suitable region of the candidate polypeptide, for 
example a PXR expressed in mouse or other species. Throughout the 
present disclosure it is intended that the term "mutant" encompass not only 
25 mutants of a PXR LBD polypeptide but chimeric proteins generated using a 
PXR LBD as well, it is thus intended that the following discussion of mutant 
PXR LBDs apply mutatis mutandis to chimeric PXR and PXR LBD 
polypeptides and to structural equivalents thereof. 

In accordance with the present invention, a mutation can be directed to 
30 a particular site or combination of sites of a wild-type PXR LBD. For example, 
an accessory binding site or the binding cavity can be chosen for 
mutagenesis. Similarly, a residue having a location on, at or near the surface 



WO 02/095652 



PCT/US02/ 15701 



-63- 

of the polypeptide can be replaced, resulting in an altered surface charge of 
one or more charge units, as compared to the wild-type PXR and PXR LBD. 
Alternatively, an amino acid residue in a PXR or a PXR LBD can be chosen 
for replacement based on its hydrophilic or hydrophobic characteristics. 
5 Such mutants can be characterized by any one of several different 

properties as compared with the wild-type PXR LBD. For example, such 
mutants can have an altered surface charge of one or more charge units, or 
can have an increase in overall stability. Other mutants can have altered 
substrate specificity in comparison with, or a higher specific activity than, a 

1 0 wild-type PXR or PXR LB D. 

PXR and PXR LBD mutants of the present invention can be generated 
in a number of ways. For example, the wild-type sequence of a PXR or a 
PXR LBD can be mutated at those sites identified using this invention as 
desirable for mutation, by means of oligonucleotide-directed mutagenesis or 

15 other conventional methods, such as deletion. Alternatively, mutants of a 
PXR or a PXR LBD can be generated by the site-specific replacement of a 
particular amino acid with an unnaturally occurring amino acid. In addition, 
PXR or PXR LBD mutants can be generated through replacement of an 
amino acid residue, for example, a particular cysteine or methionine residue, 

20 with selenocysteine or selenomethionine. This can be achieved by growing a 
host organism capable of expressing either the wild-type or mutant 
polypeptide on a growth medium depleted of either natural cysteine or 
methionine (or both) but enriched in selenocysteine or selenomethionine (or 
both). 

25 Mutations can be introduced into a DNA sequence coding for a PXR or 

a PXR LBD using synthetic oligonucleotides. These oligonucleotides contain 
nucleotide sequences flanking the desired mutation sites. Mutations can be 
generated in the full-length DNA sequence of a PXR or a PXR LBD or in any 
sequence coding for polypeptide fragments of a PXR or a PXR LBD. 

30 According to the present invention, a mutated PXR or PXR LBD DNA 

sequence produced by the methods described above, or any alternative 
methods known in the art, can be expressed using an expression vector. An 
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expression vector, as is well known to those of skill in the art, typically 
includes elements that permit autonomous replication in a host cell 
independent of the. host genome, and one or more phenotypic markers for 
selection purposes. Either prior to or after insertion of the DNA sequences 
5 surrounding the desired PXR or PXR LBD mutant coding sequence, an 
expression vector also will include control sequences encoding a promoter, 
operator, ribosome binding site, translation initiation signal, and, optionally, a 
repressor gene or various activator genes and a signal for termination. In 
some embodiments, where secretion of the produced mutant is desired, 

10 nucleotides encoding a "signal sequence" can be inserted prior to a PXR or a 
PXR LBD mutant coding sequence. For expression under the direction of the 
control sequences, a desired DNA sequence must be operatively linked to the 
control sequences; that is, the sequence must have an appropriate start 
signal in front of the DNA sequence encoding the PXR or PXR LBD mutant, 

1 5 and the correct reading frame to permit expression of that sequence under the 
control of the control sequences and production of the desired 'product 
encoded by that PXR or PXR LBD sequence must be maintained. 

Any of a wide variety of well-known available expression vectors can 
be useful to express a mutated PXR or PXR LBD coding sequences of this 

20 invention and generated as described in Laboratory Example 3. These 
expression vectors can be used in the techniques disclosed in Laboratory 
Examples 1 and 3 and can include, for example, vectors comprising segments 
of chromosomal, non-chromosomal and synthetic DNA sequences, such as 
various known derivatives of SV40, known bacterial plasmids, e.g., plasmids 

25 from E. coli including col E1, pCR1, pBR322, pMB9 and their derivatives, 
wider host range plasmids, e.g., RP4, phage DNAs, e.g., the numerous 
derivatives of phage X, e.g., NM 989, and other DNA phages, e.g., M13 and 
filamentous single stranded DNA phages, yeast plasmids and vectors derived 
from combinations of plasmids and phage DNAs, such as plasmids which 

30 have been modified to employ phage DNA or other expression control 
sequences. In a preferred embodiment of this invention, the E. coli vector 
pRSETA, including a T7-based expression system, is employed. 
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In addition, any of a wide variety of expression control sequences- 
sequences that control the expression of a DNA sequence when operatively 
linked to it— can be used in these vectors to express the mutated DNA 
sequences according to this invention. Such useful expression control 
5 sequences, include, for example, the early and late promoters of SV40 for 
animal cells, the lac system, the trp system the TAC or TRC system, the 
major operator and promoter regions of phage X, the control regions of fd coat 
protein, all for E. co//, the promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the 

10 promoters of the yeast cc-mating factors for yeast, and other sequences 
known to control the expression of genes of prokaryotic or eukaryotic cells or 
their viruses, and various combinations thereof. 

A wide variety of hosts are also useful for producing mutated hPXR 
and hPXR LBD polypeptides according to this invention. These hosts include, 

15 for example, bacteria, such as E. co//, Bacillus and Streptomyces, fungi* such 
as yeasts, and animal cells, such as CHO and COS-1 cells, plant cells, insect 
cells, such as Sf9 cells, and transgenic host cells. 

It should be understood that not all expression vectors and expression 
systems function in the same way to express mutated DNA sequences of this 

20 invention, and to produce modified PXR and PXR LBD polypeptides or PXR 
or PXR LBD mutants. Neither do all hosts function equally well with the same 
expression system. One of skill in the art can, however, make a selection 
among these vectors, expression control sequences and hosts without undue 
experimentation and without departing from the scope of this invention. For 

25 example, an important consideration in selecting a vector will be the ability of 
the vector to replicate in a given host. The copy number of the vector, the 
ability to control that copy number, and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, should also be considered. 
In selecting an expression control sequence, a variety of factors should 

30 also be considered. These include, for example, the relative strength of the 
system, its controllability and its compatibility with the DNA sequence 
encoding a modified PXR or PXR LBD polypeptide of this invention, with 
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particular regard to the formation of potential secondary and tertiary 
structures. 

Hosts should be selected by consideration of their compatibility with the 
chosen vector, the toxicity of a modified PXR or PXR LBD to them, their ability 
5 to express mature products, their ability to fold proteins correctly, their 
fermentation requirements, the ease of purification of a modified PXR or PXR 
LBD and safety. Within these parameters, one of skill in the art can select 
various vector/expression control system/host combinations that will produce 
useful amounts of a mutant PXR or PXR LBD. A mutant PXR or PXR LBD 

10 produced in these systems can be purified by a variety of conventional steps 
and strategies, ; including those used to purify the wild-type PXR or PXR LBD. 

Once a PXR LBD mutation(s) has been generated in the desired 
location, such as an ligand binding or dimerization site, the mutants can be 
tested for any one of several properties of interest. For example, mutants can 

15 be screened for an altered charge at physiological pH. This is determined by 
measuring the mutant PXR or PXR LBD isoelectric point (pi) and comparing 
the obseiyed value with that of the wild-type parent. Isoelectric point can be 
measured by gel-electrophoresis according to the method of Wellner ( Wellner , 
(1971) Anal. Chem. 43: 597). A mutant PXR or PXR LBD polypeptide 

20 containing a replacement amino acid located at the surface of the enzyme, as 
provided by the structural information of this invention, can lead to an altered 
surface charge and an altered pi. 

VIII .C. Generation of an Engineered PXR or PXR LBD Mutant 
25 In another aspect of the present invention, a unique PXR or PXR LBD 

polypeptide can be generated. Such a mutant can facilitate purification and 
can facilitate the study of the ligand binding abilities of a PXR polypeptide. 

As used in the following discussion, the terms "engineered PXR", 
"engineered PXR LDB", "PXR mutant", and "PXR LBD mutant" refers to 
30 polypeptides having amino acid sequences which contain at least one 
mutation in the wild-type sequence. The terms also refer to PXR and PXR 
LBD polypeptides which are capable of exerting a biological effect in that they 
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comprise all or a part of the amino acid sequence of an engineered PXR or 
PXR LBD mutant polypeptide of the present invention, or cross-react with 
antibodies raised \ against an engineered PXR or PXR LBD mutant 
polypeptide, or retain all or some or an enhanced degree of the biological 
5 activity of the engineered PXR or PXR LBD mutant amino acid sequence or 
protein. Such biological activity can include lipid binding in general, and fatty 
acid binding in particular. 

The terms "engineered PXR LBD" and "PXR LBD mutant" also includes 
analogs of an engineered PXR LBD or PXR LBD mutant polypeptide. By 

10 "analog" is intended that a DNA or polypeptide sequence can contain 
alterations relative to the sequences disclosed herein, yet retain all or some or 
an enhanced degree of the biological activity of those sequences. Analogs 
can be derived from genomic nucleotide sequences or from other organisms, 
or can be created synthetically. Those of skill in the art will appreciate that 

15 other analogs, as yet undisclosed or undiscovered, can be used to design 
and/or construct PXR LBD or PXR LBD mutant analogs. There is no need for 
an engineered PXR LBD or PXR LBD mutant polypeptide to comprise all or 
substantially all of the amino acid sequence of SEQ ID NOs: 2 or 4. Shorter 
or longer sequences are anticipated to be of use in the invention; shorter 

20 sequences are herein referred to as "segments". Thus, the terms "engineered 
PXR LBD 1 ' and "PXR LBD mutant" also includes fusion, chimeric or 
recombinant engineered PXR LBD or PXR LBD mutant polypeptides and 
proteins comprising sequences of the present invention. Methods of 
preparing such proteins are disclosed herein above and are known in the art. 

25 

VIII. D. Sequence Similarity and Identity 

As used herein, the term "substantially similar" means that a particular 
sequence varies from nucleic acid sequence of SEQ ID NOs: 1 or 3, or the 
amino acid sequence of SEQ ID NOs: 2 or 4 by one or more deletions, 
30 substitutions, or additions, the net effect of which is to retain at least some of 
biological activity of the natural gene, gene product, or sequence. Such 
sequences include "mutant" or "polymorphic" sequences, or sequences in 
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which the biological activity and/or the physical properties are altered to some 
degree but retains at least some or an enhanced degree of the original 
biological activity and/or physical properties. In determining nucleic acid 
sequences, all subject nucleic acid sequences capable of encoding 
5 substantially similar amino acid sequences are considered to be substantially 
similar to a reference nucleic acid sequence, regardless of differences in 
codon sequences or substitution of equivalent amino acids to create 
biologically functional equivalents. 

10 VI11.D.1, Sequences That are Substantially Identical to an 

Engineered PXR or PXR LBD Mutant Sequence of the 
Present Invention 

Nucleic acids that are substantially identical to a nucleic acid sequence 
of an engineered PXR or PXR LBD mutant of the present invention, e.g. allelic 

15 variants, genetically altered versions of the gene, etc., bind to an engineered 
PXR or PXR LBD mutant sequence under stringent hybridization conditions. 
By using probes, particularly labeled probes of DNA sequences, one can 
isolate homologous or related genes. The source of homologous genes can 
be any species, e.g. primate species; rodents, such as rats and mice, 

20 canines, felines, bovines, equines, yeast, nematodes, etc. 

Between mammalian species, e.g. human and mouse, homologs have 
substantial sequence similarity, i.e. at least 75% sequence identity between 
nucleotide sequences. Sequence similarity is calculated based on a 
reference sequence, which can be a subset of a larger sequence, such as a 

25 conserved motif, coding region, flanking region, etc. A reference sequence 
will usually be at least about 18 nucleotides(nt) long, more usually at least 
about 30 nt long, and can extend to the complete sequence that is being 
compared. Algorithms for sequence analysis are known in the art, such as 
BLAST, described in Altschul etal. , (1990) J. Mol. Biol. 215: 403-10. 

30 Percent identity or percent similarity of a DNA or peptide sequence can 

be determined, for example, by comparing sequence information using the 
GAP computer program, available from the University of Wisconsin Geneticist 
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Computer Group. The GAP program utilizes the alignment method of 
Needleman et al. , (1970) J. Mol. Biol. 48: 443, as revised by Smith et al. , 
(1981) Adv. Appl. Math. 2:482. Briefly, the GAP program defines similarity as 
the number of aligned symbols (i.e., nucleotides or amino acids) that are 
5 similar, divided by the total number of symbols in the shorter of the two 
sequences. The preferred parameters for the GAP program are the default 
parameters, which do not impose a penalty for end gaps. See, e.g. , Schwartz 
et al. , eds., (1979), Atlas of Protein Sequence and Structure, National 
Biomedical Research Foundation , pp. 357-358, and Gribskov et al. , (1986) 

10 Nucl. Acids. Res. 14: 6745. 

The terrh "similarity" is contrasted with the term "identity". Similarity is 
defined as above; "identity", however, means a nucleic acid or amino acid 
sequence having the same amino acid at the same relative position in a given 
family member of a gene family. Homology and similarity are generally 

15 viewed as broader terms than the term identity. Biochemically similar amino 
acids, for example leucine/isoleucine or glutamate/aspartate, can be present 
at the same position — these are not identical per se, but are biochemically 
"similar." As disclosed herein, these are referred to as conservative 
differences or conservative substitutions. This differs from a conservative 

20 mutation at the DNA level, which changes the nucleotide sequence without 
making a change in the encoded amino acid, e.g. TCC to TCA, both of which 
encode serine. 

As used herein, DNA analog sequences are "substantially identical" to 
specific DNA sequences disclosed herein if: (a) the DNA analog sequence is 

25 derived from coding regions of the nucleic acid sequence shown in SEQ ID 
NOs: 1 or 3; or (b) the DNA analog sequence is capable of hybridization with 
DNA sequences of (a) under stringent conditions and which encode a 
biologically active hPXR or hPXR LBD gene product; or (c) the DNA 
sequences are degenerate as a result of alternative genetic code to the DNA 

30 analog sequences defined in (a) and/or (b). Substantially identical analog 
proteins and nucleic acids will have between about 70% and 80%, preferably 
between about 81% to about 90% or even more preferably between about 
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91% and 99% sequence identity with the corresponding sequence of the 
native protein or nucleic acid. Sequences having lesser degrees of identity 
but comparable biological activity are considered to be equivalents. 

As used herein, "stringent conditions" means conditions of high 
5 stringency, for example 6X SSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% 
bovine serum albumin, 0.1% sodium dodecyl sulfate, 100 fig/ml salmon sperm 
DNA and 15% formamide at 68°C. For the purposes of specifying additional 
conditions of high stringency, preferred conditions are salt concentration of 
about 200 mM and temperature of about 45°C. One example of such 

10 stringent conditions is hybridization at 4X SSC, at 65°C, followed by a 
washing in O.tX SSC at 65°C for one hour. Another exemplary stringent 
hybridization scheme uses 50% formamide, 4X SSC at 42°C. 

In contrast, nucleic acids having sequence similarity are detected by 
hybridization under lower stringency conditions. Thus, sequence identity can 

15 be determined by hybridization under lower stringency conditions, for 
example, at 50°C or higher and 0.1 X SSC (9 mM NaCI/0.9 mM sodium'citrate) 
and the sequences will remain bound when subjected to washing at 55°C in 
1XSSC. 



20 VIII. D.2. Complementarity and Hybridization to an Engineered 

PXR or PXR LBD Mutant Sequence 
As used herein, the term "complementary sequences" means nucleic 
acid sequences which are base-paired according to the standard Watson- 
Crick complementarity rules. The present invention also encompasses the 
25 use of nucleotide segments that are complementary to the sequences of the 
present invention. 

Hybridization can also be used for assessing complementary 
sequences and/or isolating complementary nucleotide sequences. As 
discussed above, nucleic acid hybridization will be affected by such conditions 
30 as salt concentration, temperature, or organic solvents, in addition to the base 
composition, length of the complementary strands, and the number of 
nucleotide base mismatches between the hybridizing nucleic acids, as will be 
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readily appreciated by those skilled in the art. Stringent temperature 
conditions will generally include temperatures in excess of about 30°C, 
typically in excess of about 37°C, and preferably in excess of about 45°C. 
Stringent salt conditions will ordinarily be less than about 1,000 mM, typically 
5 less than about 500 mM, and preferably less than about 200 mM. However, 
the combination of parameters is much more important than the measure of 
any single parameter. See, e.g. , Wetmur & Davidson , (1968) J. Mol. Biol. 31: 
349-70. Determining appropriate hybridization conditions to identify and/or 
isolate sequences containing high levels of homology is well known in the art. 
10 See, e.g. , Sambrook et al„ (1989) Molecular Cloning: A Laboratory Manual , 
Cold Spring Harbor, New York. 

VIII. D-3. Functional Equivalents of an Engineered PXR or PXR 
LBD Mutant Nucleic Acid Sequence of the Present 
15 Invention 

As used herein, the term "functionally equivalent codon" is used to refer 
to codons that encode the same amino acid, such as the ACG and AGU 
codons for serine. Human PXR or hPXR LBD-encoding nucleic acid 
sequences comprising SEQ ID NOs: 1 and 3, which have functionally 
20 equivalent codons, are covered by the present invention. Thus, when 
referring to the sequence example presented in SEQ ID NOs: 1 and 3, 
applicants contemplate substitution of functionally equivalent codons into the 
sequence example of SEQ ID NOs: 1 and 3. Thus, applicants are in 
possession of amino acid and nucleic acids sequences which include such 
25 substitutions but which are not set forth herein in their entirety for 
convenience. 

It will also be understood by those of skill in the art that amino acid and 
nucleic acid sequences can include additional residues, such as additional N- 
or C-terminal amino acids or 5' or 3'. nucleic acid sequences, and yet still be 
30 essentially as set forth in one of the sequences disclosed herein, so long as 
the sequence retains biological protein activity where polypeptide expression 
is concerned. The addition of terminal sequences particularly applies to 
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nucleic acid sequences which can, for example, include various non-coding 
sequences flanking either of the 5' or 3' portions of the coding region or can 
include various internal sequences, i.e., introns, which are known to occur 
within genes. 

5 

VIII. D.4. Biological Equivalents 

The present invention envisions and includes biological equivalents of 
an engineered PXR or PXR LBD mutant polypeptide of the present invention. 
The term "biological equivalent" refers to proteins having amino acid 

10 sequences which are substantially identical to the amino acid sequence of an 
engineered PXR LBD mutant of the present invention and which are capable 
of exerting a biological effect in that they are capable of binding DNA moieties 
or cross-reacting with anti-PXR or PXR LBD mutant antibodies raised against 
an engineered mutant PXR or PXR LBD polypeptide of the present invention. 

15 For example, certain amino acids can be substituted for other amino 

acids in a protein structure without appreciable loss of interactive capacity 
with, for example, structures in the nucleus of a cell. Since it is the interactive 
capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein 

20 sequence (or the nucleic acid sequence encoding it) to obtain a protein with 
the same, enhanced, or antagonistic properties. Such properties can be 
achieved by interaction with the normal targets of the protein, but this need 
not be the case, and the biological activity of the invention is not limited to a 
particular mechanism of action. It is thus in accordance with the present 

25 invention that various changes can be made in the amino acid sequence of an 
engineered PXR or PXR LBD mutant polypeptide of the present invention or 
its underlying nucleic acid sequence without appreciable loss of biological 
utility or activity. 

Biologically equivalent polypeptides, as used herein, are polypeptides 
30 in which certain, but not most or all, of the amino acids can be substituted. 
Thus, when referring to the sequence examples presented in SEQ ID NOs: 1 
and 3, applicants envision substitution of codons that encode biologically 
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equivalent amino acids, as described herein, into the sequence example of 
SEQ ID NOs: 1 and 3, respectively. Thus, applicants are in possession of 
amino acid and nucleic acids sequences which include such substitutions but 
which are not set forth herein in their entirety for convenience. 

5 Alternatively, functionally equivalent proteins or peptides can be 

created via the application of recombinant DNA technology, in which changes 
in the protein structure can be engineered, based on considerations of the 
properties of the amino acids being exchanged, e.g. substitution of lie for Leu. 
Changes designed by man can be introduced through the application of site- 

10 directed mutagenesis techniques, e.g., to introduce improvements to the 
antigenicity of 'the protein or to test an engineered PXR or PXR LBD mutant 
polypeptide of the present invention in order to modulate DNA-binding, lipid- 
binding or other activity, at the molecular level. 

Amino acid substitutions, such as-those which might be employed in 

15 modifying an engineered PXR or PXR LBD mutant polypeptide of the present 
invention are generally, but not necessarily, based on the relative similarity of 
the amino acid side-chain substituentsi, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. An analysis of the size, shape and 
type of the amino acid side-chain substituents reveals that arginine, lysine and 

20 histidine are all positively charged residues; that alanine, glycine and serine* 
are all of similar size; and that phenylalanine, tryptophan and tyrosine all have 
a generally similar shape. Therefore, based upon these considerations, 
arginine, lysine and histidine; alanine, glycine and serine; and phenylalanine, 
tryptophan and tyrosine; are defined herein as biologically functional 

25 equivalents. Other biologically functionally equivalent changes will be 
appreciated by those of skill in the art. It is implicit in the above discussion, 
however, that one of skill in the art can appreciate that a radical, rather than a 
conservative substitution is warranted in a given situation. Non-conservative 
substitutions in engineered mutant PXR or PXR LBD polypeptides of the 

30 present invention are also an aspect of the present invention. 

In making biologically functional equivatent amino acid substitutions, 
the hydropathic index of amino acids can be considered. Each amino acid 
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has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); 
leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 
alanine (+ 1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); 
5 tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (- 
3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 
interactive biological function on a protein is generally understood in the art 
( Kyte & Doolittle , (1982), J. MoL Biol. 157: 105-132, incorporated herein by 

10 reference). It is known that certain amino acids can be substituted for other 
amino acids having a similar hydropathic index or score and still retain a 
similar biological activity. In making changes based upon the hydropathic 
index, the substitution of amino acids whose hydropathic indices are within ±2 
of the original value is preferred, those which are within ±1 of the original 

15 value are particularly preferred, and those within ±0.5 of the original value are 
even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids 
can be made effectively on the basis of hydrophilicity. U.S. Patent No. 
4,554,101, incorporated herein by reference, states that the greatest local 

20 average hydrophilicity of a protein, as governed by the hydrophilicity of its 
adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. 
with a biological property of the protein. It is understood that an amino acid 
can be substituted for another having a similar hydrophilicity value and still 
obtain a biologically equivalent protein. 

25 As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity 

values have been assigned to amino acid residues: arginine (+ 3.0); lysine (+ 
3.0); aspartate (+ 3.0±1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine (+ 
0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5±1); alanine (- 
0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (- 

30 1 .8); isoleucine (-1 .8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

in making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
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original value is preferred, those which are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
more particularly preferred. 

While discussion has focused on functionally equivalent polypeptides 

5 arising from amino acid changes, it will be appreciated that these changes 
can be effected by alteration of the encoding DNA. taking into consideration 
also that the genetic code is degenerate and that two or more codons can 
code for the same amino acid. 

Thus, it will also be understood that this invention is not limited to the 

10 particular amino acid and nucleic acid sequences of SEQ ID NOs: 1-4. 
Recombinant Sectors and isolated DNA segments can therefore variously 
include an engineered hPXR or hPXR LBD mutant polypeptide-encoding 
region itself, include coding regions bearing selected alterations or 
modifications in the basic coding region, or include larger polypeptides which 

15 nevertheless comprise a hPXR or hPXR LBD mutant polypeptide-encoding 
regions or can encode biologically functional equivalent proteins or 
polypeptides which have variant amino acid sequences. Biological activity of 
an engineered hPXR or hPXR LBD mutant polypeptide can be determined, for 
example, by lipid-binding assays known to those of skill in the art. 

20 The nucleic acid segments of the present invention, regardless of the 

length of the coding sequence itself, can be combined with other DNA 
sequences, such as promoters, enhancers, polyadenylation signals, additional 
restriction enzyme sites, multiple cloning sites, other coding segments, and 
the like, such that their overall length can vary considerably. It is therefore 

25 contemplated that a nucleic acid fragment of almost any length can be 
employed, with the total length preferably being limited by the ease of 
preparation and use in the intended recombinant DNA protocol. For example, 
nucleic acid fragments can be prepared which include a short stretch 
complementary to a nucleic acid sequence set forth in SEQ ID NOs: 1 and 3, 

30 such as about 10 nucleotides, and which are up to 10,000 or 5,000 base pairs 
in length. DNA segments with total lengths of about 4,000, 3,000, 2,000, 
1 ,000, 500, 200, 100, and about 50 base pairs in length are also useful. 
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The DNA segments of the present invention encompass biologically 
functional equivalents of engineered PXR or PXR LBD mutant polypeptides. 
Such sequences can rise as a consequence of codon redundancy and 
functional equivalency that are known to occur naturally within nucleic acid 

5 sequences and the proteins thus encoded. Alternatively, functionally 
equivalent proteins or polypeptides can be created via the application of 
recombinant DNA technology, in which changes in the protein structure can 
be engineered, based on considerations of the properties of the amino acids 
being exchanged. Changes can be introduced through the application of site- 

10 directed mutagenesis techniques, e.g., to introduce improvements to the 
antigenicity of the protein or to test variants of an engineered PXR or PXR 
LBD mutant of the present invention in order to examine the degree of lipid- 
binding activity, or other activity at the molecular level. Various site-directed 
mutagenesis techniques are known to those of skill in the art and can be 

15 employed in the present invention. 

The invention further encompasses fusion proteins and peptides 
wherein an engineered PXR or PXR LBD mutant coding region of the present 
invention is aligned within the same expression unit with other proteins or 
peptides having desired functions, such as for purification or immunodetection 

20 purposes. 

Recombinant vectors form important further aspects of the present 
invention. Particularly useful vectors are those in which the coding portion of 
the DNA segment is positioned under the control of a promoter. The promoter 
can be that naturally associated with a PXR gene, as can be obtained by 

25 isolating the 5' non-coding sequences located upstream of the coding 
segment or exon, for example, using recombinant cloning and/or PCR 
technology and/or other methods known in the art, in conjunction with the 
compositions disclosed herein. 

In other embodiments, certain advantages will be gained by positioning 

30 the coding DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is a 
promoter that is not normally associated with a PXR gene in its natural 
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environment Such promoters can include promoters isolated from bacterial, 
viral, eukaryotic, or mammalian cells. Naturally, it will be important to employ 
a promoter that effectively directs the expression of the DNA segment in the 
cell type chosen for expression. The use of promoter and cell type 
5 combinations for protein expression is generally known to those of skill in the 
art of molecular biology (See, e.g. , Sambrook et al. , (1989) Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor Laboratory, New York, specifically 
incorporated herein by reference). The promoters employed can be 
constitutive or inducible and can be used under the appropriate conditions to 
10 direct high level expression of the introduced DNA segment, such as is 
advantageous 1 in the large-scale production of recombinant proteins or 
peptides. One preferred promoter system contemplated for use in high-level 
expression is a T7 promoter-based system. 

15 IX. Uses of PXR Mutants 

The PXR mutants disclosed herein have a variety of applications, 
including in the screening of components for PXR/SXR activation using the 
cell-free reporter gene assay methods disclosed herein above, and using 
whole animal models. The PXR mutants can also be used in cell-free, cell- 

20 based and whole animal assay methods for bioavailability of compounds and 
for toxicology analysis. Additionally, PXR mutants can be employed in 
crystallizations, screening for changes in ligand activation, screening for 
species-specific changes in ligand activation and screening for changes in 
oligomerization state both with and without ligand. 

25 

IX.A. Evaluating a Candidate Therapeutic Agent in Humans Using a 

Mouse Model System 
A mutant PXR comprising a "humanized , ' mouse can find particular 
utility as a model for the study of drug metabolism in humans. Such a model 
30 can be more informative regarding drug metabolism in humans than studies 
using unmodified mouse PXR as a predictive model for human drug 
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metabolism. It also offers the convenience and advantages of employing a 
mouse model system to study a human protein. 

In a prefen^d embodiment, a method of evaluating a candidate 
therapeutic agent in humans using a mouse model system comprises the 
5 following steps. Atomic coordinates of a human PXR ligand binding domain 
are provided to a computerized modeling system. Preferred coordinates are 
supplied in Table 4 of the present disclosure. Based on the human PXR 
atomic coordinates, a candidate therapeutic agent that fits spatially into a 
binding cavity or on the surface of a human PXR ligand binding domain is 

1 0 modeled. Modeling of a candidate therapeutic agent can conveniently employ 
a computer-baked approach. Computer programs, such as the INSIGHT II® 
program (Molecular Simulations, Inc., San Diego, California) can be employed 
to create three-dimensional structures of candidate agents. After a candidate 
therapeutic agent structure is generated, a computer program (which can 

15 comprise the appropriate design and structure evaluation modules), can be 
employed to preliminarily evaluate the candidate agent for use as a 
therapeutic. The candidate agent can be docked with a site on the PXR, for 
example, and the efficiency of the interaction can be computationally 
evaluated for various physical properties, such as steric considerations and 

20 efficiency of binding. 

A mouse PXR polypeptide is provided, and one or more mutations to 
be introduced into the mouse PXR amino acid sequence of the mouse PXR 
polypeptide are selected. The mutations are preferably selected so as to alter 
the mouse PXR polypeptide to be similar to a human PXR polypeptide. 

25 Although any mutations can be selected, preferred mutations are selected 
from the group consisting of an arginine to leucine substitution at residue 203 
of the mouse PXR polypeptide, a protein to serine substitution at residue 205 
of the mouse PXR polypeptide, a glutamine to histidine substitution at residue 
404 of the mouse PXR polypeptide and a glutamine to arginine substitution at 

30 residue 407 of the mouse PXR polypeptide. 

The selected one or more mutations is/are introduced into a mouse 
PXR polypeptide and a mutant mouse PXR polypeptide comprising the 
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selected one or more mutations selected is provided. The mutant mouse 
PXR polypeptide is expressed and purified using methods that are disclosed 
herein and will be apparent to those of skill in the art, upon consideration of 
the present disclosure. 
5 The effect of the candidate therapeutic agent on the mutant mouse 

PXR polypeptide is determined. The effect can be determined by employing 
any appropriate reporter system. For example, the biological activity of a PXR 
polypeptide can be employed to determine an interaction. 

The potential of the candidate therapeutic agent for use in humans is 

10 evaluated based on the effect of the candidate therapeutic agent on the 
mutant moused PXR polypeptide. Thus, in one aspect of the present invention, 
the suitability of a candidate therapeutic for human disorders and conditions is 
conveniently assessed in a mouse model system. This system offers the 
ability to assess the biological response of a candidate therapeutic in vivo, 

15 which can add an additional degree of confidence to a conclusion made 
regarding the candidate therapeutic. 

)C. The Role of the Three-Dimensional Structure of the hPXR LDB in 
Solving Additional PXR Crystals 

20 Because polypeptides can crystallize in more than one crystal form, the 

structural coordinates of a hPXR LBD, or portions thereof, as provided by the 
present invention, are particularly useful in solving the structure of other 
crystal forms of hPXR and the crystalline forms of other PXRs. The 
coordinates provided in the present invention can also be used to solve the 

25 structure of PXR or PXR LBD mutants (such as those described in Section 
VIII above), PXR LDB co-complexes, or of the crystalline form of any other 
protein with significant amino acid sequence homology to any functional 
domain of PXR. 

30 lXA Determining the Three-Dimensional Structure of a Polypeptide 

Using the Three-Dimensional Structure of the hPXR LBD as a 



Template in Molecular Replacement 
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One method that can be employed for the purpose of solving additional 
PXR crystal structures is molecular replacement. See generally , Rossmann , 
ed, (1972) The Molecular Replacement Method , Gordon & Breach, New York. 
In the molecular replacement method, the unknown crystal structure, whether 
5 it is another crystal form of a PXR or a PXR LBD, (i.e. a PXR or a PXR LBD 
mutant), or a PXR or a PXR LBD polypeptide complexed with another 
compound (a "co-complex"), or the crystal of some other protein with 
significant amino acid sequence homology to any functional region of the a 
hPXR LBD, can be determined using the hPXR LBD structure coordinates 

10 provided in Table 4. This method provides an accurate structural form for the 
unknown crystal more quickly and efficiently than attempting to determine 
such information ab initio. 

In addition, in accordance with this invention, PXR or PXR mutants 
(preferably hPXR or hPXR LBD mutants) can be crystallized in complex with 

15 known modulators. The crystal structures of a series of such complexes can 
then be solved by molecular replacement and compared with that of wild-type 
hPXR or the wild-type hPXR LBD. Potential sites for modification within the 
various binding sites of the enzyme can thus be identified. This information 
provides an additional tool for determining the most efficient binding 

20 interactions, for example, increased hydrophobic interactions, between the 
hPXR LBD and a chemical entity or compound. 

AH of the complexes referred to in the present disclosure can be 
studied using X-ray diffraction techniques ( See, e.g. , Blundell & Johnson 
(1985) Method. Enzymol., 114A & 115B, (Wyckoff et al. , eds.), Academic 

25 Press) and can be refined using computer software, such as the X-PLOR™ 
program ( Brunger , (1992) X-PLOR, Version 3.1 A System for X-ray 
Crystallography and NMR, Yale University Press, New Haven, Connecticut; 
X-PLOR is available from Molecular Simulations, Inc., San Diego, California). 
This information can thus be used to optimize known classes of PXR and 

30 PXR LBD modulators, and more importantly, to design and synthesize novel 
classes of PXR and PXR LBD modulators. 
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Laboratory Examples 
The following Laboratory Examples have been included to illustrate 
preferred modes of the invention. Certain aspects of the following Laboratory 
Examples are described in terms of techniques and procedures found or 

5 contemplated by the present inventors to work well in the practice of the 
invention. These Laboratory Examples are exemplified through the use of 
standard laboratory practices of the inventors. In light of the present 
disclosure and the general level of skill in the art, those of skill will appreciate 
that the following Laboratory Examples are intended to be exemplary only and 

10 that numerous changes, modifications and alterations can be employed 
without departing from the spirit and scope of the invention. 

Laboratory Example 1 

PXR Expression and Purification 

15 The PXR LBD expression construct was engineered as an N-terminal 

polyhistidine tagged fusion protein with, residues 130-434 from the human 
PXR. The fusion insert (SEQ ID NO: 10) was subcloned into the pRSETA 
expression vector (Invitrogen, Carlsbad, California). DNA encoding residues 
623-710 of the human SRC-1 gene ( Onate et al. , (1995) Science 270: 1354- 
20 57) (SEQ ID NO: 12) were subcloned into the bacterial vector pACYC184 
(American Type Culture Collection #37033) along with a T7 promoter ( Nolte et 
ah, (1998) Nature 395: 137-43). The hPXRLBD/pRSETA and the SRC- 
1/pACYCI84 plasmids were co-transformed into the BL21(DE3) strain of E. 
coli. One-liter shake flask liquid cultures containing standard Luria-Bertani 
25 (LB) broth with 0.05 mg/ml ampicillin and 0.05 mg/ml chloramphenicol were 
inoculated and grown at 22°C for 20 hours. The cells were harvested by 
centrifugation (20 minutes, 3500g, 4°C) and the cell pellet was stored at - 
80°C. The cell pellet was resuspended in 250 ml of Buffer A (50 mM Tris-CI 
pH 7.8, 250 mM NaCL, 50 mM imidazole pH 7.5, 5% glycerol). Cells were 
30 sonicated for 3-5 minutes on ice and the cell debris was removed by 
centrifugation (45 minutes, 20,000g, 4°C). The cleared supernatant was 
filtered through a 0.45 filter and loaded on to a 50 ml PROBOND™ nickel- 
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chelating resin (Invitrogen, Carlsbad, California). After washing to baseline 
. with Buffer A, the column was washed with Buffer A containing 125 mM, 
imidazole pH 7.5. The PXR-LBD/SRC-1 complex was eluted from the column 
using Buffer A with 500 mM imidazole pH 7.5. Column fractions were pooled 

5 and concentrated using CENTRI-PREP™ 30K (Amicon/Millipore, Bedford, 
Massachusetts) units. The protein was subjected to size exclusion, using a 
column (26 mm X 90 cm) packed with SEPHAROSE™ S-75 resin (Amersham 
Pharmacia Biotech, Piscataway, New Jersey) pre-equilibrated with 20mM 
Tris-CI pH 7.8, 250 mM NaCI, 5 mM DTT, 2.5 mM EDTA pH 8.0,5% glycerol. 

10 Column fractions containing the PXR/SRC-1 complex were pooled and diluted 
five fold with the dilution buffer (20 mM Tris-CI pH 7.8, 5rnM DTT, 2.5mM EDT 
A pH 8.0, 5% glycerol) and loaded on to a 50 ml Q SEPHAROSE™ FAST 
FLOW™ column (Amersham Pharmacia Biotech, Piscataway, New Jersey). 
The column was washed to baseline with 20mM Tris-CI pH 7.8,50 mM NaCL, 

15 5 mM DTT, 2.5 mM EDTA pH 8.0, 5% glycerol, and the protein complex was 
eluted from the column utilizing an increased salt gradient. The appropriate 
column fractions were pooled and stored on ice. 



Laboratory Example 2 

20 Crystallization of a PXR Ligand binding Domain 

The human PXR ligand binding domain/SRC-1 complex (hPXR- 
LBD/SRC-1) was concentrated in the presence of 10-fold molar excesses of 
the SR12813 or I-SR12813 compounds to final concentrations of 4 and 5 
mg/mL, respectively. The apo complex was concentrated to 5 mg/mL. 

25 Crystallization of hPXR-LBD was achieved by hanging-drop vapor diffusion 
against the following conditions at 22°C: SRI2813: 50 mM imidazole at pH 
7.2,10% 2-propanol (v/v); I-SRI2813: 50 mM imidazole at pH 7.2, 11% 2- 
propanol (v/v); apo: 50 mM imidazole at pH 7.4,10% 2-propanol (v/v). 
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Laboratory Example 3 
Structure Determination 
The structures of an apo (unliganded) and SRI28l3-bound form of the 
ligand binding domain of human PXR were determined by molecular 
5 replacement using the crystal structure of the vitamin D receptor (VDR) as 
search models (SEQ ID NO: 7, structural coordinates presented in Table 5). 
Non-identical side chains were trimmed prior to rotation and translation 
function searches in AmoRe (Navaza & Saludjian , (1997) Method Enzymol. 
276A: 581-94); clear solutions were obtained for each in the proper 
10 enantiomorphic space group, P4 3 2i2. The structures were refined using the 
torsion angle pVotocol in CNS with the maximum likelihood function as a 
target, and included an overall anisotropic B-factor and a bulk solvent 
correction ( Brungeret al„ (1998)>4cfa Crystallog. D 54: 905-21). 

For both structures, 10% of the observed data were set aside for cross- 
15 validation using the free-R statistic prior to any structural refinement (Brunger , 
(1993) Acta Crystallog. D 49: 24-36). Manual adjustments and rebuilding of 
the model were performed using O ( Jones et al. , (1991) Acta Crystallog. A 
47:110-19) and aA-weighted electron density maps ( Read , (1986) Acta 
Crystallog. A 42: 140-49). At the later stages of refinement 281 and 166 
20 period solvent sites were added to the apo and SR12813 complexes, 
respectively. Structures exhibit good geometry with no Ramachandran 
outliers, as shown in Table 1. Molecular graphics figures were created with 
MOLSCRIPT ( Kraulis , (1991) J. Applied Crystallogr. 24: 946-50), RASTER-3D 
( Merritt & Bacon , (1997) Method Enzymol. 277: 505-24), GRASP ( Nicholls et 
25 al, (1991) Proteins 11: 281-96) and CS CHEMDRAW PRO™, available from 
CambridgeSoft of Cambridge, Massachusetts. 

Laboratory Example 4 
Positioning of SR12813 Ligands 
30 Clear electron density was evident in the ligand binding cavity of PXR 

at the earliest stages of refinement. The positioning of single SR12813 
molecules into this electron density, however, resulted in significant positive 
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electron density peaks elsewhere in the cavity, suggesting that more than one 
conformation of the ligand was present. This was confirmed using difference 
density generated from crystals of hPXR-LBD containing an iodinated form of 
SR12813 (I-SR12813). The two t-butyl groups of SR12813 were replaced 

5 with iodine in I-SR12813; this ligand was found to activate PXR as well as the 
parent SR12813 compound. Two difference density maps were used to guide' 
the positioning of SR12813 ligands: [(| F ob s ,SR12813 I - I F obs SR12813 |) t 
*ca.c PR0TE,N ] and [(| F obs tSR12813 | - | F obs PROTEIN |), * ca . c PR0TE,N ]. 

Each map was calculated to 3.0 A resolution and both showed a 

10 consistent set of significant positive difference density peaks in the ligand 
binding cavity, which were interpreted to be the positions of the iodine atoms 
on ISR12813. These peaks were the highest difference density peaks in both 
maps. It was found that three SR12813 ligands optimally satisfied the electron 
density in the ligand binding cavity. The first SR12813 (position 1) was placed 

15 in part based on two 7o difference density peaks at appropriate positions in 
both maps. The second SR12813 (position 3) was positioned using two 5o 
difference density peaks at appropriate positions. The third SR12813 
(position 2) was placed based on 4a iodine difference density peaks and 
residual difference density in the maps after the refinement of positions 1 and 

20 3 together. 

Standard simulated density maps, [(| F 0 bs PR0TElN I - I F^ lc PR0TEIN i), 
<!>ca]c PR0TaN ], at 2.75 A resolution were used to guide the positioning of the 
remainder of each SR12813 ligand and to assess the quality of the fits to the 
density ( Brunger et al. , (1990) Acta Crystallog. A 46: 585-93). No positive or 

25 negative difference density peaks above 30' were present in the binding cavity 
after the refinement of the three ligand positions together. Occupancies of 
each ligand position were estimated based on iodine difference density and 
standard electron density peaks, and were fixed at 0.5, 0.2 and 0.3 for 
positions 1, 2 and 3, respectively. In addition, the R and R-free after the 

30 positioning of the three ligand positions were improved over each 
conformation alone, or over pairs of each conformation. 
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Laboratory Example 5 
Mutagenesis 

The human .and mouse PXR mutants were generated using the 
QUIKCHANGE™ mutagenesis kit (Stratagene, La Jolla, California) according- 
5 to the manufacturer's instructions. All mutants were confirmed by sequence 
analysis. 

Laboratory Example 6 

Cotransfection Assays 
10 Transient cotransfection experiments were performed using CV -1 cells 

and the XREM-CYP3A4-LUC reporter, containing the enhancer and promoter 
of CYP3A4 driving luciferase expression, as described in Moore et al. , (2000) 
J. Biol. Chem. 275: 15122-27. 
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TABLE 1 

CRYSTALLOGRAPHIC DATA AND REFINEMENT 



APO 


SR12813 


I-SR12813 


20-2.5 (2.54-2.5) 


20-2.75 (2.8-2.75) 


20-3.0 (3.05-3.0) 


P4 3 2 1 2 


P4 3 2i2 


P4 3 2 1 2 


A=91.6;c=85.0 


a=91.3; c=85.3 


a=91.1; c=84.5 


179,671 


133,315 


91,752 


12,611 


9,512 


7,269 


14.2 


14 


12.6 


7.3 (39.0) 


9.8 (40.0) 


9.7 (51.0) 


99.9 (100) 


99.6 (100) 


99.9 (100) 


40.8 (7.9) 


25.9 (3.7) 


31.4 (5.4) 


20.9 (25.9) 


21.3 (26.1) 




28.4 (32.1) 


27.4 (40.5) 




0.0059 


0.0093 




1.11 


1.29 




20.54 


21.33 




0.71 


0.91 




2141 


2141 




281 


166 






99 





Resolution (A; highest shell) 
Space Group 
Cell Constants (A) 
Total Reflections 
Unique Reflections 
Mean Redundancy 
Rsym* (%; highest shell) 
Completeness (%; highest 
shell) 

Mean l/a (highest shell) 
Rayst 1 (%: highest shell) 
Rfree (%; highest shell) 
RMSD § Bond Lengths (A) 
RMSD § Bond Angles (°) 
RMSD § Dihedrals (°) 
RMSD § Impropers (°) 
Number Protein Atoms 
Number Solvent Sites 
Number Ligand Atoms 



* Data from crystals containing 1-SR12813 were used only to guide and confirm the 
positioning of SR12813 ligands; thus, a refined structure was not generated. 

* R sym = 2|/-</>| / S/, where / is the observed intensity and </> is the average intensity of 
multiple symmetry-related observations of that reflection. 

1 Rcryst = n\Fobs\'\F calc \\ I Z\F obs \ t where F obs and F calc are the observed and calculated structure 
factors, respectively. 

* Rfree = 2|Fota|-|F CBte || / S|F o6s | for 10% of the data not used at any stage of structural 
refinement. 

§ RMSD, root mean square deviation. 
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TABLE 2 



COMPARISON Of HUMAN PXR-LBD WITH NUCLEAR RECEPTOR LBDs 

OF KNOWN STRUCTURE 





Number Residues 
Superimposed 


Sequence 
Identity 
(%) 


RMSD (A) 


Ligand binding 
Cavity Volume" (A 3 ) 


PXR 








1,150 


Vitamin D Receptor 


225 


45 


1.8 


871 


RXRa 


178 


22 


2.8 


687 


PPARy 


203 


25 


2.6 


1,619 


Progesterone Receptor 


201 


19 


2.9 


557 


Estrogen Receptor 


180 


18 


2.8 


476 



* The cavity volumes were calculated with GRASP ( Nicholls et ah, (1991) Proteins 11: 281- 
96) using a grid spacing of 0.2 A, probe radius of 1.4 A and the atomic radii of Bondi (H-1.2 
A, c=1.7 A, N=1.55 A, 0-1.52 A, S=1.8 A) (Bondi, (1964) J. Phys. Chem. 68: 441-51), where 
hydrogens were treated explicitly. Openings in the PXR, PPARy and RXRa pockets were 
closed by covering these proteins with an external layer of water molecules prior to 
calculation of the molecular surface and cavities. 
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TABLE 3 

CALCULATED BURIED SOLVENT-ACCESSIBLE SURFACE AREA OF 
AMINO ACID RESIDUES THAT LINE THE LIGAND BINDING CAVITY OF 

HPXR 

Residue Buried Contacts Human Rabbit Rat Mouse 
Surface SR12813 





(A 2 ) 


Ligand? 










206 


9.8 


Yes 


L 


L 


R 


S 


208 


15.8 


Yes 


S 


T 


P 


P 


209 


30.5 


Yes 


L 


M 


M 


M 


211 


16.3 


Yes 


V 


L 


1 


1 


240 


;14.8 


Yes 


L 


L 


L 


L 


243 


43.6 


Yes 


M 


L 


L 


L 


244 


1.9 


No 


A 


A 


A 


A 


246 


6.1 


Yes 


M 


M 


V 


V 


247 


30.4 


Yes 


S 


S 


s 


s 


251 


11.2 


Yes 


F 


F 


F 


F 


281 


29.3 


Yes 


F 


L 


F 


F 


284 


9.6 


Yes 


C 


C 


C 


C 


285 


23.0 


Yes 


Q 


L 


1 


'" 1 . 


288 


29.6 


Yes 


F 


F 


F 


F 


299 


28.6 


Yes 


W 


W 


W 


W 


306 


12.9 


No 


Y 


Y 


Y 


Y 


308 


3.7 


No 


L 


V 


F 


F 


321 


4.2 


No 


E 


D 


D 


D 


323 


20.3 


Yes 


M 


L 


L 


L 


324 


12.0 


Yes 


L 


L 


M 


M 


327 


2.5 


No 


H 


H. 


H 


H 


407 


41.6 


Yes 


H 


H 


Q 


Q 


410 


14.7 


Yes 


R 


R 


Q 


Q 


411 


33.1 


No 


L 


L 


L 


L 


414 


11.5 


No 


I 


I 


1 


1 


420 


9.3 


Yes 


F 


F 


F 


F 


425 


6.2 


No 


M 


M 


M 


M 


429 


3.6 


No 


F 


F 


F 


F 



Boldface indicates identity; italics indicate similarity. The buried surface area upon ligand 



binding was calculated using CNS ( Brunger et al. , (1998) Acta Crystallogr. D 54: 905-21). 
The four residues underlined were mutated to examine the ligand specificity of mouse and 
human PXR. 
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TABLE 4 



ATOMIC COORDINATE DATA FOR CRYSTALLIZED HUMAN PXR 



A 1 UM 


ATOM 
TVDC 

1 Yrb 


KEolUUt 




v 
A 


Y 

T 


2 


occ 


B 


A 
1 


O 
0 


Ol V 

uLY 


*. A 9 
14^ 


^ ftOft 

-o.ouo 


AA 7*vl 


1? SR1 


1 .00 


58.97 


2 


V 


Ol V 


A AO 
1 4ii 


^ 701 


AC. COO 


14 S1 5 


1.00 


59.54 


6 


M 


Ol V 
OUT 


\ AO 
14Z 


A **77 
-4.0/ f 


A*\ 177 
HO. I » f 


14 R49 


1 .00 


59.37 


A 

4 


OA 


Ol V 

GLY 


■1 A O 
14^ 


-0.0U/ 






1 00 

1 .UU 


59.68 


5 


In 


LbU 


A A *3 

140 


-O.0Z4 


/1C 4 no 
H-O. 1 VJO 


1? ^87 


1 .00 


58.87 


6 


CA 


LbU 


^ a i 
140 








1 00 


58.50 


7 


CB 


1 CI 1 

LbU 


A A 1 

140 


-D.400 


/ie QA7 

4o.yu / 


in 7^r 


1 00 

1 .VJVJ 


57.90 


8 


CG 


LEU 


A A O 

14*5 


-o.t54y 


aq OQ(\ 
4-O.^vJU 


in 


1 00 


57.77 


9 


CD1 


1 CI t 

LbU 


140 


-4.oyy 


4ft A1 1 
*t0.4- I I 


1 1 4D7 


1 00 

1 . UU 


56.51 


10 


. CD2 


1 CM 1 

LbU 


140 


-O.OUO 


Aft AQ9 
4o.4a^ 


q nan 


1 00 


56.92 


1 1 


C 


LbU 


*1 A 1 
140 


ft 1^0 


Aft AAR 




1 .00 


58.92 


12 


0 


1 CI 1 

LEU 


140 


-y.U4o 


At; RAn 

40.D4U 




1 00 

1 .UU 


59.85 


13 


N 


. THR 


AAA 
144 


Q QCO 


A7 *3A1 


1^ 17A 


1 00 

I .UU 


58.88 


14 


CA 


THR 


AAA 

144 


-1 o.^yy 


A 7 A CV7 


*t q AAA 
1 0.444 


1 00 


76 

UC7 . t U 


15 


CB 


Tun 

THR 


AAA 
144 


-i u.oy4 


Aft 1 ^A 


1 A 770 


1 00 

1 .Uu 


58.88 


16 


OG1 


THK 


AAA 
144 


A f\ A A 7 

-1 U.44 1 


AQ f^AQ 
4-y .04o 


14 ^RR 

I I.OOJ 


1 .00 


56.11 


17 


CG2 


THR 


AAA 

144 


-y.b44 


y17 RRH 


1 ^ 8^7 


1 00 


58 12 


18 


C 


THR 


AAA 

144 


AAA 07 
-1 1 


/ift -tn^ 

4C5. 1 UO 


19 


1 00 

1 .uVJ 


61.28 


19 


O 


THR 


AAA 

144 


-1 U.OU4 


/I ft Q^l 


I I.O IO 


1 00 


62.11 


20 


N 


GLU 


145 


An AOC 

-12.400 


A 7 Q7K 


1 9 A1 Q 

1*1.4 iy 


1 00 

I .VJVJ 


62 26 


21 


CA 


GLU 


A A C 

14b 


-I O.OoU 


Aft /1ft9 


1 1 477 

1 l .4 * ( 


1 00 


62.32 


22 


CB 


GLU 


A A C 

145 


-14. /yy 


/i q *t qn 

4C5.10U 


1 1 87n 


1 00 


63.47 


23 


CG 


GLU 


145 


-1 O.ooO 


Aft ftA9 


1 1 nfifi 

I I .uuu 


1.00 


67.03 


24 


CD 


GLU 


A A C 
145 


-1 0. /DO 


/1ft RA9 




1 .00 


68.86 


25 


/-X 1 — A 

OE1 


Ol 1 1 

GLU 


140 


i A QKO 


A7 7R0 


Q 19Q 


1.00 


70.24 


26 


OE2 


GLU 


145 


-i o.4yz 


AQ "399 


R ROQ 


1 .00 


68.98 


27 


C 


O 1 1 1 

GLU 


145 


HO-* CO 

-1 0.100 


AQ QQA 


1 1 .476 


1 .00 


61.32 


28 


U 


Ol 1 1 


140 


i 9 QQ7 
- ! z.yy r 




10.424 


1 .00 


60.90 


29 


M 

N 


Ol 1 1 

GLU 


4 4R 
|4D 


1 ^ i RQ 

- 1 o, i oy 


en c;QA 


12.662 


1.00 


60.97 


30 


o a 
OA 


Ol 1 1 

LjLU 


H AR 
14D 


•1 9 Qftft 

- i/i.yoo 


^9 n^7 


12.793 


1.00 


61.06 


31 


on 


Ol 1 1 

bLU 


*1 AR 
140 


1 ^ 09R 




14.269 


1.00 


64.21 


O 1 

oz 


oo 


Ol ! | 

bLU 


1 Afi 
1 ~ru 


-19 RR7 


51.378 


15.254 


1.00 


68.91 


OO 




Ol 1 J 


1 *-r\J 


-1^ 6^8 


50.326 


15.526 


1.00 


71.06 


**A 
04 


OF1 


Ol 1 1 
OLU 


146 


-14.703 


50.686 


16.079 


1.00 


71.24 


35 


OE2 


GLU 


146 


-13.415 


49.142 


15.184 


1.00 


72.11 


36 


C 


GLU 


146 


-11.688 


52.514 


12.161 


1.00 


59.14 


37 


0 


GLU 


146 


-11.652 


53.556 


11.504 


1.00 


59.31 


38 


N 


GLN 


147 


-10.620 


51.754 


12.371 


1.00 


57.26 


39 


CA 


GLN 


147 


-9.325 


52.095 


11.802 


1.00 


54.84 


40 


CB 


GLN 


147 


-8.248 


51.126 


12.309 


1.00 


54.51 


41 


CG 


GLN 


147 


-7.812 


51.390 


13.755 


1.00 


53.99 


42 


CD 


GLN 


147 


-6.957 


50.275 


14.353 


1.00 


54.37 


43 


OE1 


GLN 


147 


-6.291 


50.474 


15.368 


1.00 


55.65 


44 


NE2 


GLN 


147 


-6.984 


49.096 


13.737 


1.00 


54.19 


45 


C 


GLN 


147 


-9.430 


52.034 


10.283 


1.00 


53.65 


46 


0 


GLN 


147 


-8.948 


52.921 


9.581 


1.00 


52.79 


47 


N 


ARG 


148 


-10.083 


50.991 


9.783 


1.00 


52.52 


48 


CA 


ARG 


148 


-10.259 


50.819 


8.349 


1.00 


51.82 
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49 


CB 


ARG 


148 


-11.020 


49.528 


8.061 


1.00 


52.78 


50 


CG 


ARG 


148 


-10.368 


48.278 


8.626 


1.00 


55.07 


51 


CD 


ARG 


148 


-11.157 


47.046 


8.233 


1.00 


56.61 


52 


NE 


ARG 


148 


-11.103 


46.818 


6.792 


1.00 


59.42 


53 


CZ 


ARG 


148 


-11.976 


46.074 


6.116 


1.00 


61.30 


54 


NH1 


ARG 


148 


-12.982 


45.485 


6.752 


1.00 


62.10 


55 


NH2 


ARG 


148 


-11.839 


45.912 


4.805 


1.00 


62.11 


56 


c 


ARG 


148 


-11.010 


51.991 


7.732 


1.00 


50.99 


57 

O I 


o 


ARG 


148 


-10.585 


52.545 


6.720 


1.00 


52.39 




N 


MET 


149 


-12.129 


52.369 


8.336 


1.00 


50.18 




CA 


MET 


149 


-12.920 


53.472 


7.813 


1.00 


50.13 


60 


CB 


MET 


149 


-14.194 


53.665 


8.643 


1.00 


53.54 




CG 


MET 


149 


-15.135 


52.461 


8.644 


1.00 


57.84 


62 


SD 


MET 


149 


-16.690 


52.746 


9.560 


1.00 


64.69 


63 


CE 


MET 


149 


-16.093 


52.876 


1 1 .284 


1.00 


59.97 


64 


c 


MET 


149 


-12.105. 


54.755 


7.821 


1.00 


48.09 


65 


o 


MET 


149 


-12.210 


55.576 


6.912 


1.00 


48.07 


66 


N ; 


MET 


150 


-11.285 


54.918 


8.850 


1.00 


45.74 


67 


CA 


MET 


150 


-10.458 


56.107 


8.990 


1.00 


43.72 


68 • 


CB 


MET 


150 


-9.721 


56.049 


10.318 


1.00 


43.70 


69 


CG 


MET 


150 


-9.356 


57.394 


10.877 


1.00 


45.87 


70 


SD 


MET 


150 


-8.499 


57.190 


12.442 


1.00 


49.99 


71 


CE 


MET 


150 


-9.883 


56.803 


13.553 


1.00 


48.07 


1*> 
1 c 


n 


MET 


150 


-9.453 


56.229 


7.845 


1.00 


42.57 


1 o 


n 


MET 


150 


-9.258 


57.307 


7.275 


1.00 


41.08 


74 


N 

IN 


ILE 


151 


-8.822 


55.111 


7.507 


1.00 


41.43 


7^ 


PA 


ILE 


151 


-7.837 


55.087 


6.439 


1.00 


40.06 


7R 


PR 


ILE 


151 


-7.118, 


53.726 


6.387 


1.00 


39.91 


77 




ILE 


151 


-6.064 


53.727 


5.273 


1.00 


39.50 


7ft 
/ o 




ILE 


151 


-6.449 


53.459 


7.742 


1.00 


38.84 


7Q 




ILE 


151 


-5.681 


52.170 


7.818 


1.00 


37.16 


ou 


c 


1 L_ 1 — 


151 


-8.503 


55.377 


5.105 


1.00 


39.71 


O I 


n 


ILE 


151 


-8.014 


56.190 


4.311 


1.00 


39.37 




IN 


ARG 


152 


-9.627 


54.717 


4.864 


1.00 


39.09 


83 


CA 


ARG 


152 


-10.363 


54.920 


3.627 


1.00 


37.68 


84 


CB 


ARG 


152 


-11.571 


53.980 


3.584 


1.00 


40.07 






ARG 


152 


-12.593 


54.281 


2.493 


1.00 


45.57 


86 


CD 


ARG 


152 


-11.962 


54.439 


1.111 


1.00 


48.14 


87 


NE 


ARG 


152 


-12.786 


53.812 


0.080 


1.00 


50.92 


88 


CZ 


ARG 


152 


-12.837 


52.500 


-0.134 


1.00 


50.95 


89 


NH1 


ARG 


152 


-12.106 


51.678 


0.608 


1.00 


50.25 


90 


NH2 


ARG 


152 


-13.623 


52.006 


-1 .082 


1.00 


50.40 


91 


c 


ARG 


152 


-10.807 


56.373 


3.491 


1.00 


35.71 


92 


o 


ARG 


152 


-10.785 


56.926 


2.399 


1.00 


34.62 


93 


N 


GLU 


153 


-11.203 


56.996 


4.595 


1.00 


35.31 


94 


CA 


GLU 


153 


-11.641 


58.384 


4.537 


1.00 


36.30 


95 


CB 


GLU 


153 | -12.264 


58.822 


5.864 


1.00 


39.33 


96 


CG 


GLU 


153 


-12.848 


60.228 


5.808 


1.00 


45.7U 


97 


CD 


GLU 


153 


-13.770 


60.561 


6.982 


1.00 


52.48 


98 


OE1 


GLU 


153 


-14,304 


61.695 


6.992 


1.00 


54.21 


99 


OE2 


GLU 


153 


-13.971 


59.708 


7.888 


1.00 


53.74 


100 


C 


GLU 


153 


-10.459 


59.278 


4,202 


1,00 


35.22 


101 


O 


GLU 


153 


-10.594 


60.257 


3.460 


1.00 


33.48 


102 


N 


LEU 


154 


-9.297 


58.938 


4.753 


1.00 


34.10 


103 


CA 


LEU 


154 


-8.089 


59.706 


4.486 


1.00 


32.93 
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104 


CB 


LEU 


154 


-6.973 


59.284 


5.436 


1.00 


32.59 


105 


CG 


LEU 


154 


-7.114 


59.676 


6.903 


1.00 


32.45 


106 


CD1 


LEU 


154 


-5.829 


59.293 


7.648 


1.00 


28.94 


107 


CD2 


LEU 


154 


-7.391 


61.180 


7.007 


1.00 


30.74 


108 


C 


LEU 


154 


-7.648 


59.486 


3.040 


1.00 


32.49 


109 


0 


LEU 


154 


-7.273 


60.438 


2.342 


1.00 


31.27 


110 


N 


MET 


155 


-7.689 


58.227 


2.596 


1.00 


31.58 


111 


CA 


MET 


155 


-7.307 


57.877 


1.227 


1.00 


31.69 


112 


CB 


MET 


155 


-7.271 


56.361 


1.046 


1.00 


28.61 


113 


CG 


MET 


155 


-6.056 


55.708 


1.673 


1.00 


29.23 


114 


SD 


MET 


155 


-4.498 


56.482 


1.116 


1.00 


28.89 


115 


CE 


MET 


155 


-4.096 


55.468 


-0.360 


1.00 


30.07 


116 


C 


MET 


155 


-8.256 


58.485 


0.200 


1.00 


32.29 


117 


0 


MET 


155 


-7.836 


58.922 


-0.871 


1.00 


32.41 


118 


N 


ASP 


156 


-9.539 


58.518 


0.535 


1.00 


32.13 


119 


CA 


ASP 


156 


-10.530 


59.078 


-0.367 


1.00 


32.52 


120 


CB 


ASP 


156 


-11.938 


58.791 


0.172 


1.00 


35.14 


121 


CG * 


ASP 


156 


-13.033 


59.420 


-0.671 


1.00 


36.51 


122 


0D1 


ASP 


156 


-13.515 


60.517 


-0.303 


1.00 


35.94 


123 


0D2 


ASP 


156 


-13.404 


58.814 


-1.702 


1.00 


39.04 


124 


C 


ASP 


156 


-10.287 


60.575 


-0.496 


1.00 


31.62 


125 


0 


ASP 


156 


-10.373 


61.140 


-1.586 


1.00 


30.81 


126 


N 


ALA 


157 


-9.960 


61.213 


0.621 


1.00 


31.28 


127 


CA 


ALA 


157 


-9.700 


62.646 


0.628 


1.00 


30.89 


128 


CB 


ALA 


157 


-9.547 


63.128 


2.050 


1.00 


29.70 


129 


C 


ALA 


157 


-8.448 


62.977 


-0.182 


1.00 


32.21 


130 


0 


ALA 


157 


-8.377 


64.029 


-0.815 


1.00 


31.78 


131 


N 


GLN 


158 


-7.469 


62.070 


-0.155 


1.00 


32.47 


132 


CA 


GLN 


158 


-6.216 


62.245 


-0.880 


1.00 


33.24 


133 


CB 


GLN 


158 


-5.214 


61.153 


-0.497 


1.00 


35.65 


134 


CG 


GLN 


158 


-4.160 


61.542 


0.535 


1.00 


37.85 


135 


CD 


GLN 


158 


-3.119 


62.520 


0.020 


1.00 


34.82 


136 


OE1 


GLN 


158 


-3.399 


63.703 


-0.168 


1.00 


35.15 


137 


NE2 


GLN 


158 


-1.907 


62.028 


-0.203 


1.00 


34.05 


138 


c 


GLN 


158 


-6.380 


62.215 


-2.392 


1.00 


34.09 


139 


0 


GLN 


158 


-5.916 


63.114 


-3.085 


1.00 


34.76 


140 


N 


MET 


159 


-7.019 


61.173 


-2.913 


1.00 


34.17 


141 


CA 


MET 


159 


-7.176 


61.089 


-4.352 


1.00 


34.32 


142 


CB 


MET 


159 


-7.639 


59.686 


-4.764 


1.00 


35.03 


143 


CG 


MET 


159 


-9.088 


59.365 


-4.524 


1.00 


37.89 


144 


SD 


MET 


159 


-10.089 


59.609 


-6.008 


1.00 


40.60 


145 


CE 


MET 


159 


-11.625 


60.101 


-5.262 


1.00 


36.87 


146 


C 


MET 


159 


-8.114 


62.167 


-4.885 


1.00 


33.55 


147 


0 


MET 


159 
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1.00 


83.99 
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544 


CB 


LYS 


234 


21.492 


92.884 


-7.075 


1.00 


83.43 


545 


C 


LYS 


234 


20.322 


90.676 


-7.236 


1.00 


82.47 


546 


0 


LYS 


234 


20.864 


90.243 


-8.251 


1.00 


83.25 


547 


N 


GLU 


235 


19.090 


90.340 


-6.871 


1.00 


80.37 


548 


CA 


SLU 


235 


18.244 


89.461 


-7.676 


1.00 


78.49 


549 


CB 


GLU 


235 


18.052 


90.061 


-9.072 


1.00 


79.29 


550 


C 


GLU 


235 


18.669 


87.998 


-7.814 


1.00 


77.17 


551 


0 


GLU 


235 


19.144 


87.562 


-8.866 


1.00 


76.76 


552 


N 


ILE 


236 


18.490 


87.248 


-6.736 


1.00 


75.47 


553 


CA 


ILE 


236 


18.781 


85.819 


-6.710 


1.00 


72.63 


554 


CB 


ILE 


236 


19.778 


85.461 


-5.586 


1.00 


73.98 


555 


CG2 


ILE 


236 


21.206 


85.691 


-6.065 


1.00 


74.10 


556 


CG1 


ILE 


236 


19.458 


86.280 


-4.328 


1.00 


74.81 


557 


CD1 


ILE 


236 


20.463 


86.106 


-3.189 


1.00 


75.72 


558 


C 


ILE 


236 


17.414 


85.228 


-6.399 


1.00 


69.92 


559 


0 


ILE 


236 


17.280 


84.069 


-5.997 


1.00 


69.58 


560 


N 


PHE 


237 


16.406 


86.077 


-6.587 


1.00 


66.10 


561 


CA* 


PHE 


237 


15.010 


85.739 


-6.356 


1.00 


61.51 


562 


CB 


PHE 


237 


14.291 


86.898 


-5.667 


1.00 


62.12 


563 


CG 


PHE 


237 


15.115 


87.589 


-4.628 


1.00 


60.86 


564 


CD1 


PHE 


237 


15.850 


88.723 


-4.951 


1.00 


59.98 


565 


CD2 


PHE 


237 


15.177 


87.089 


-3.331 


1.00 


59.56 


566 


CE1 


PHE 


237 


16.637 


89.348 


-3.997 


1.00 


60.01 


567 


CE2 


PHE 


237 


15.957 


87.702 


-2.373 


1.00 


59.37 


568 


CZ 


PHE 


237 


16.691 


88.836 


-2.704 


1.00 


60.00 


569 


C 


PHE 


237 


14.348 


85.491 


-7.696 


1.00 


58.40 


570 


0 


PHE 


237 


13.124 


85.511 


-7.812 


1.00 


58.37 


571 


N 


SER 


238 


15.169 


85.279 


-8.716 


1.00 


54.37 


572 


CA 


SER 


238 


14.657 


85.028 


-10.049 


1.00 


49.97 


573 


CB 


SER 


238 


15.824 


84.698 


-10.990 


1.00 


50.06 


574 


OG 


SER 


238 


16.776 


83.867 


-10.349 


1.00 


50.94 


575 


C 


SER 


238 


13.627 


83.896 


-10,042 


1.00 


46.05 


576 


O 


SER 


238 


12.686 


83.896 


-10.837 


1.00 


44.82 


577 


N 


LEU 


239 


13.782 


82.956 


-9.115 


1,00 


41.74 


578 


CA 


LEU 


239 


12.877 


81.819 


-9.037 


1,00 


37.63 


579 


CB 


LEU 


239 


13.703 


80.554 


-8.827 


1.00 


38.30 


580 


CG 


LEU 


239 


13.323 


79.289 


-9.603 


1.00 


40.67 


581 


CD1 


LEU 


239 


12.910 


79.620 


-11.030 


1.00 


40.00 


582 


CD2 


LEU 


239 


14.520 


78.346 


-9.605 


1.00 


41.71 


583 


C 


LEU 


239 


11.759 


81.893 


-7.986 


1.00 


35.73 


584 


0 


LEU 


239 


10.905 


81.017 


-7.949 


1.00 


35.63 


585 


N 


LEU 


240 


11.743 


82.929 


-7.150 


1.00 


33.56 


586 


CA 


LEU 


240 


10.713 


83.057 


-6.110 


1.00 


31.58 


587 


CB 


LEU 


240 


10.948 


84.294 


-5.229 


1.00 


32.56 


588 


CG 


LEU 


240 


11.966 


84.203 


-4.095 


1.00 


32.81 


589 


CD1 


LEU 


240 


11.841 


85.443 


-3.232 


1.00 


34.47 


590 


CD2 


LEU 


240 


11.715 


82.961 


-3.262 


1.00 


33.77 


591 


C 


LEU 


240 


9.280 


83.117 


-6.594 


1.00 


29.50 


592 


O 


LEU 


240 


8.439 


82.349 


-6.144 


1.00 


30.03 


593 


N 


PRO 


241 


8.970 


84.052 


-7.500 


1.00 


28.88 


594 


CD 


PRO 


241 


9.866 


84.966 


-8.230 


1.00 


27.35 


595 


CA 


PRO 


241 


7.593 


84.154 


-7.993 


1.00 


29.16 


596 


CB 


PRO 


241 


7.712 


85.125 


-9.166 


1.00 


27.20 


597 


CG 


PRO 


241 


8.902 


85.951 


-8.813 


1.00 


26.11 


598 


C 


PRO 


241 


7.024 


82.800 


-8.417 


1.00 


31.71 
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599 


0 


PRO 


241 


5.997 


82.360 


-7.892 


1.00 


32.87 


600 


N 


HIS 


242 


7.690 


82.153 


-9.374 


1.00 


32.44 


601 


CA 


HIS 


242 


7.273 


80.840 


-9.869 


1.00 


33.09 


602 


CB 


HIS 


242 


8.376 


80.183 


-10.696 


1.00 


33.12 


603 


CG 


*HIS 


242 


8.151 


78.722 


-10.931 


1.00 


31.81 


604 


CD2 


HIS 


242 


8.575 


77.633 


-10.246 


1.00 


33.86 


605 


ND1 


HIS 


242 


7.315 


78.251 


-11.916 


1.00 


33.72 


606 


CE1 


HIS 


242 


7.228 


76.935 


-11.828 


1.00 


34.99 


607 


NE2 


HIS 


242 


7.982 


76.535 


-10.821 


1.00 


34.87 


608 


C 


HIS 


242 


7.010 


79.923 


-8.705 


1.00 


32.78 


609 


0 


HIS 


242 


5.955 


79.300 


-8.603 


1.00 


33.32 


610 


N 


MET 


243 


8.027 


79.830 


-7.859 


1.00 


34.86 


611 


CA 


MET 


243 


8.016 


78.989 


-6.681 


1.00 


37.25 


612 


CB 


MET 


243 


9.270 


79.237 


.-5.843 


1.00 


41.17 


613 


CG 


MET 


243 


10.541 


78.739 


-6.445 


1.00 


48.42 


614 


SD 


MET 


243 


10.419 


76.969 


-6.614 


1.00 


56.97 


615 


CE 


MET 


243 


11.078 


76.678 


-8.280 


1.00 


58.78 


616 


C i 


MET 


243 


6.785 


79.266 


-5.866 


1.00 


36.91 


617 


O 


MET 


243 


6.110 


78.336 


-5.445 


1.00 


36.25 


618 


N 


ALA 


244 


6.505 


80.541 


-5.615 


1.00 


36.65 


619 


CA 


ALA 


244 


5.316 


80.924 


-4.846 


1.00 


37.41 


620 


CB 


ALA 


244 


5.249 


82.457 


-4.683 


1.00 


37.71 


621 


C 


ALA 


244 


4.017 


80.424 


-5.490 


1.00 


36.31 


622 


0 


ALA 


244 


3.086 


80.012 


-4.799 


1.00 


37.79 


623 


N 


ASP 


245 


3.940 


80.475 


-6.814 


1.00 


35.19 


624 


CA 


ASP 


245 


2.742 


80.016 


-7.497 


1.00 


34.32 


625 


CB 


ASP 


245 


2.810 


80.355 


-8.985 


1.00 


31.56 


626 


CG 


ASP 


245 


2.738 


81.849 


-9.235 


1.00 


30.74 


627 


OD1 


ASP 


245 


2.014 


82.528 


-8.482 


1.00 


28.28 


628 


OD2 


ASP 


245 


3.389 


82.346 


-10.180 


1.00 


30.84 


629 


C 


ASP 


245 


2.529 


78.525 


-7.295 


1.00 


34.63 


630 


O 


ASP 


245 


1.390 


78.085 


-7.105 


1.00 


36.05 


631 


N 


MET 


246 


3.619 


77.760 


-7.323 


1.00 


34.62 


632 


CA 


MET 


246 


3.571 


76.306 


-7.126 


1.00 


35.13 


633 


CB 


MET 


246 


4.946 


75.686 


-7.378 


1.00 


35.05 


634 


CG 


MET 


246 


5.028 


74.211 


-7.042 


1.00 


35.60 


635 


SD 


MET 


246 


4.127 


73.170 


-8.205 


1.00 


36.53 


636 


CE 


MET 


246 


2.599 


73.067 


-7.434 


1.00 


37.78 


637 


C 


MET 


246 


3.139 


75.989 


-5,697 


1.00 


34.29 


638 


O 


MET 


246 


2.446 


75.006 


-5.436 


1.00 


33.03 


639 


N 


SER 


247 


3.566 


76.835 


-4.774 


1.00 


33.90 


640 


CA 


SER 


247 


3.215 


76.663 


-3.378 


1.00 


34.79 


641 


CB 


SER 


247 


3.991 


77.651 


-2.525 


1.00 


37.25 


642 


OG 


SER 


247 


5.349 


77.302 


-2.533 


1.00 


41.24 


643 


C 


SER 


247 


1.737 


76.897 


-3.171 


1.00 


32.81 


644 


0 


SER 


247 


1.084 


76.218 


-2.380 


1.00 


32.97 


645 


N 


THR 


248 


1.214 


77.881 


-3.882 


1.00 


32.23 


646 


CA 


THR 


248 


-0.190 


78.218 


-3.772 


1.00 


31.48 


647 


CB 


THR 


248 


-0.494 


79.492 


-4.549 


1.00 


32.12 


648 


OG1 


THR 


248 


0.321 


80.553 


-4.035 


1.00 


33.82 


649 


CG2 


THR 


248 


-1.961 


79.864 


-4.410 


1.00 


31,31 


650 


C 


THR 


248 


-1.026 


77.075 


-4.310 


1.00 


30.39 


651 


O 


THR 


248 


-2.012 


76.677 


-3.693 


1.00 


29.27 


652 


N 


TYR 


249 


-0.622 


76.546 


-5.462 


1.00 


30.38 


653 


CA 


TYR 


249 


-1.329 


75.434 


-6.076 


1.00 


30,33 
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654 


CB 


TYR 


249 


-0.649 


75.034 


-7.386 


1.00 


29.81 


655 


CG 


TYR 


249 


-1.224 


73.784 


-8.008 


1.00 


30.36 


656 - 


CD1 


TYR 


249 


-2.531 


73.766 


-8.498 


1.00 


28.62 


657 


CE1 


TYR 


249 


-3.086 


72.601 


-8.996 


1.00 


31.63 


658 


CD2 


TYR 


249 


-0.482 


72.594 


-8.043 


1.00 


28.99 


659 


CE2 


TYR 


249 


-1.018 


71.424 


-8.533 


1.00 


29.89 


660 


CZ 


TYR . 


249 


-2.328 


71.426 


-9.008 


1.00 


34.44 


661 


OH 


TYR 


249 


-2.891 


70.247 


-9.459 


1.00 


36.73 


662 


C 


TYR 


249 


-1.324 


74.270 


-5.086 


1.00 


31.21 


663 


O 


TYR 


249 


-2.360 


73.642 


-4.836 


1.00 


30.19 


664 


N 


MET 


250 


-0.155 


73.996 


-4.510 


1.00 


31.98 


665 


CA 


MET 


250 


-0.028 


72.921 


-3.527 


1.00 


33.11 


666 


CB 


MET 


250 


1.432 


72.750 


-3.110 


1.00 


32.37 


667 


CG 


MET 


250 


2.291 


72.132 


-4.186 


1.00 


31.75 


668 


SD 


MET 


250 


1.684 


70.502 


-4.687 


1.00 


34.03 


669 


CE 


MET 


250 


2.509 


69.452 


-3.449 


1.00 


31.25 


670 


C 


MET 


250 


-0.885 


73.188 


-2.291 


1.00 


33.11 


671 


O i 


MET 


250 


-1 .629 


72.323 


-1.836 


1.00 


32.66 


672 


N 


PHE 


251 


-0.775 


74.390 


-1.745 


1.00 


33.37 


673 


CA 


PHE 


251 


-1.561 


74.744 


-0.576 


1.00 


33.59 


674 


CB 


PHE 


251 


-1 .286 


76.195 


-0.188 


1.00 


34.12 


675 


CG 


PHE 


251 


0.077 


76.411 


0.392 


1.00 


34.59 


676 


CD1 


PHE 


251 


0.700 


77.647 


0.287 


1.00 


34.63 


677 


CD2 


PHE 


251 


0.724 


75.385 


1.076 


1.00 


35.08 


678 


CE1 


PHE 


251 


1.949 


77.863 


0.855 


1.00 


36.50 


679 


CE2 


PHE 


251 


1.972 


75.587 


1.648 


1.00 


35.30 


680 


CZ 


PHE 


251 


2.585 


76.834 


1.537 


1.00 


36.25 


681 


C 


PHE 


251 


-3.044 


74.540 


-0.864 


1.00 


33.35 


682 


0 


PHE 


251 


-3.742 


73.869 


-0.102 


1.00 


32.53 


683 


N 


LYS 


252 


-3.522 


75.117 


-1.964 


1.00 


32.74 


684 


CA 


LYS 


252 


-4.921 


74.972 


-2.337 


1.00 


31.51 


685 


CB 


LYS 


252 


-5.136 


75.467 


-3.766 


1.00 


31.30 


686 


CG 


LYS 


252 


-5.214 


76.978 


-3.879 


1.00 


33.17 


687 


CD 


LYS 


252 


-5.138 


77.454 


-5.324 


1.00 


36.80 


688 


CE 


LYS 


252 


-5.911 


78.769 


-5.533 


1.00 


40.52 


689 


NZ 


LYS 


252 


-5.647 


79.815 


-4.486 


1.00 


42.52 


690 


C 


LYS 


252 


-5.318 


73.503 


-2.214 


1.00 


32.28 


691 


O 


LYS 


252 


-6.335 


73.174 


-1.606 


1.00 


32.02 


692 


N 


LYS 


253 


-4.493 


72.622 


-2.774 


1.00 


32.25 


693 


CA 


LYS 


253 


-4.750 


71.190 


-2.725 


1.00 


31.71 


694 


C 


LYS 


253 


-4.820 


70.699 


-1 .281 


1.00 


33.18 


695 


0 


LYS 


253 


-5.708 


69.911 


-0.926 


1.00 


34.96 


696 


N 


ILE 


254 


-3.880 


71.164 


-0.458 


1.00 


32.64 


697 


CA 


ILE 


254 


-3.807 


70.797 


0.960 


1.00 


31.44 


698 


CB 


ILE 


254 


-2.495 


71.361 


1.609 


1.00 


31.24 


699 


CG2 


ILE 


254 


-2.757 


71.843 


3.025 


1.00 


30.88 


700 


CG1 


ILE 


254 


-1 .425 


70.273 


1.667 


1.00 


31.57 


701 


CD1 


ILE 


254 


-1.108 


69.641 


0.345 


1.00 


35.02 


702 


C 


ILE 


254 


-5.030 


71.272 


1.772 


1.00 


30.82 


703 


O 


ILE 


254 


-5.500 


70.568 


2.671 


1.00 


30.56 


704 


N 


ILE 


255 


-5.533 


72.465 


1.471 


1.00 


29.36 


705 


CA 


ILE 


255 


-6.696 


72.982 


2.190 


1.00 


29.54 


706 


CB 


ILE 


255 


-6.970 


74.485 


1.850 


1.00 


29.42 


707 


CG2 


ILE 


255 


-8.291 


74.939 


2.449 


1.00 


24.21 


708 


CG1 


ILE 


255 


-5.829 


75.357 


2.383 


1.00 


29.56 
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709 


CD1 


ILE 


255 


-5.954 


76.826 


1.993 


1.00 


29.49 


710 


C 


ILE 


255 


-7.902 


72.140 


1.798 


1.00 


28.95 


~7A A 

711 


O 


ILE 


255 


-8.761 


71.841 


2.626 


1.00 


28.74 


712 


N 


SER 


256 


-7.952 


71.752 


0.529 


1.00 


28.49 


713 


CA 


SER 


256 


-9.046 


70.935 


0.040 


1.00 


29.78 


714 


CB 


SER 


256 


-8.946 


70.765 


-1 .473 


1.00 


29.69 


715 


OG 


SER 


256 


-8.929 


72.020 


-2.116 


1.00 


31.07 


716 


C 


SER 


256 


-8.992 


69.569 


0.714 


1.00 


29.53 


717 


0 


SER 


256 


-10.015 


68.930 


0.924 


1.00 


29.42 


718 


N 


PHE 


257 


-7.790 


69.119 


1.042 


1.00 


29.28 


719 


CA 


PHE 


257 


-7.628 


67.832 


1.699 


1.00 


30.50 


720 


CB 


PHE 


257 


-6.145 


67.504 


1.855 


1.00 


29.19 


721 


CG 


PHE 


257 


-5.887 


66.239 


2.611 


1.00 


28.46 


722 


CD1 


PHE 


257 


-6.234 


65.001 


2.067 


1.00 


28.14 


723 


CD2 


PHE 


257 


-5.309 


66.281 


3.876 


1.00 


28.50 


724 


CE1 


PHE 


257 


-6.008 


63.814 


2.776 


1.00 


27.99 


725 


CE2 


PHE 


257 


-5.078 


65.102 


4.597 


1.00 


28.73 


726 


CZ i 


PHE 


257 


-5.430 


63.866 


4.042 


1.00 


29.76 


727 


C 


PHE 


257 


-8.269 


67.901 


3.075 


1.00 


31.48 


728 


0 


PHE 


257 


-9.085 


67.054 


3.450 


1.00 


29.83 


729 


N 


ALA 


258 


-7.880 


68.924 


3.824 


1.00 


33.28 


730 


CA 


ALA 


258 


-8.387 


69.137 


5.167 


1.00 


34.73 


731 


CB 


ALA 


258 


-7.792 


70.418 


5.743 


1.00 


33.79 


732 


C 


ALA 


258 


-9.910 


69.208 


5.193 


1.00 


36.37 


733 


0 


ALA 


258 


-10.565 


68.399 


5,843 


1.00 


36.08 


734 


N 


LYS 


259 


-10.464 


70.175 


4.470 


1.00 


38.81 


735 


CA 


LYS 


259 


-11.908 


70.393 


4.420 


1.00 


40.37 


736 


CB 


LYS 


259 


-12.208 


71.614 


3.553 


1.00 


39.71 


737 


CG 


LYS 


259 


-11.669 


72.926 


4.093 


1.00 


41.17 


738 


CD 


LYS 


259 


-11.994 


74,064 


3.135 


1.00 


43.90 


739 


CE 


LYS 


259 


-11.760 


75.433 


3.769 


1.00 


48.08 


740 


NZ 


LYS 


259 


-12.688 


75.721 


4,916 


1.00 


49.91 


741 


C 
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1982 


CD1 


LEU 


412 


7.278 


85.697 
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9.524 
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1983 


CD2 


LEU 


412 


6.779 
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413 
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413 
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1995 
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413 
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87.995 
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1996 
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89.186 


4.671 
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1997 
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13.031 


87.045 


4.037 
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1 .UU 
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1998 


CA 
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414 


13.015 


87.346 
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1999 


CB 
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414 


12.854 


86.046 


1 .781 


a nn 

1 .uu 
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40.DH 


2000 


CG2 


ILE 


414 


12.563 


86.378 


A O A O 
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a nn 
1 .UU 


A7 R1 


2001 


CG1 


ILE 


414 


14.119 


85.191 


a on e 

1 .92b 


a nn 
1 .UU 


AR Rft 


2002 


CD1 


ILE 


414 


14.116 


83.906 


1.107 


4 r\r\ 
1 .00 


ACL A~7 
40.41 


2003 


C 


ILE 


414 


11.873 


88.303 


2.2f 9 


*i nn 
\ .UU 


AR n7 


2004 
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ILE 


414 


12.054 


89.253 


1.516 


4 nn 
1 ,UU 


A7 ACS 
4 / .4U 


2005 


N 


GLN 


415 


10.702 


88.051 


2.859 


a nn 
1 .UU 


/fR RQ 


2006 


CA 


GLN 


415 


9.520 


88.883 


2.b2U 


1 nn 
l .UU 


A7 ^7 


2007 


CB 


GLN 


415 


8.301 


88.289 


3.326 


4 nn 
1 .UU 


AC QQ 
'fD.OO 


2008 


CG 


GLN 


415 


7.003 


89.031 


3.081 


1 nn 
1 .UU 


A9 71 


2009 


CD 


GLN 


415 


6.495 


88.845 


1 .b/o 


i nn 
1 .UU 


AA n9 


2010 


OE1 


GLN 


415 


7.043 


88.052 


a nnc 

o.yub 


a nn 
1 .UU 


ar nR 


2011 


NE2 


GLN 


415 


5.434 


89.567 


A 794 

1 .o21 


-i nn 


A9 91 


2012 


C 


GLN 


415 


9.739 


90.300 


3.127 


^ nn 
1 .UU 


AQ RQ 


2013 


O 


GLN 


415 


9.307 


ah o"7rv 
91 .2/U 
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1 nn 


49.56 


2014 


N 


ASP 


416 
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10.411 
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51.59 


2015 


CA 


ASP 


416 


10.682 
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91 .70*3 
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4. ODD 
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53.89 


2016 


CB 


ASP 


416 


A A O C A 

1 1 .obi 


QA CA 7 
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c 99Q 
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56.53 


2017 


CG 


ASP 


416 


11 .762 


no ooq 
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59.62 


2018 
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ASP 
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416 


lU.oob 
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1 .00 


60.06 


2019 


OD2 


ASP 


A AC 

41b 


A o qc;7 


Q9 QQ9 


7 91 R 


1.00 


61.63 
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c 


ASP 


416 


11.527 


92.610 


3.968 


1.00 


53.82 


2021 


O 


ASP 


416 


11.501 


93.829 


4.126 


1.00 


54.73 


2022 


N 


ILE 


. 417 


12.271 


92.037 


3.026 


1.00 


53.12 


2023 


CA 


ILE 


417 


13.086 


92.866 


2.142 


1.00 


53.22 


2024 


CB 


ILE 


417 


14.610 


92.597 


2.322 


1.00 


53.89 


2025 


CG2 


ILE 


417 


15.080 


93.156 


3.656 


1.00 


54.20 


2026 


CG1 


ILE 


417 


14.916 


91.101 


2.224 


1.00 


54.07 


2027 


CD1 


ILE 


417 


14.828 


90.535 


0.828 


1.00 


54.43 


2028 


C 


ILE 


417 


12.729 


92.706 


0.673 


1.00 


53.35 
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2029 


0 


ILE 


417 


13.124 


93.522 


-0.162 


1.00 


CO oc 

53.35 


2030 


N 


HIS 


418 


11.973 


91.661 


0.357 


1.00 


CO *70 

52.73 


2031 


CA 


HIS 


418 


11.583 


91.421 


-1 .025 


1.00 


CO A *7 

52.17 


2032 


CB 


HIS 


418 


12.631 


90.533 


-1.704 


1.00 


52.92 


2033 


CG 


HIS 


418 


12.588 


90.573 


-3.201 


1.00 


54.38 


2034 


CD2 


HIS 


418 


13.399 


91.184 


-4.097 


1.00 


53.68 


2035 


ND1 


HIS 


418 


11.624 


89.924 


-3.941 


1.00 


55.24 


2036 


CE1 


HIS 


418 


11.843 


90.131 


-5.227 


1.00 


53.97 


2037 


NE2 


HIS 


418 


12.914 


90.892 


-5.349 


1.00 


54.12 


2038 


C 


HIS 


418 


10.217 


90.754 


-1 .070 


1.00 


51.33 


2039 


0 


HIS 


418 


10.123 


89.532 


-1.136 


1.00 


53.28 


2040 


N 


PRO 


419 


9.136 


91.552 


-1 .028 


1.00 


49.45 


2041 


CD 


PRO 


419 


9.090 


93.015 


-1.173 


1.00 


48.47 


2042 


CA 


PRO 


419 


7.785 


90.985 


-1.067 


1.00 


47.63 


2043 


CB 


PRO 


419 


6.895 


92.225 


-1.134 


1.00 


47.50 


2044 


CG 


PRO 


419 


7.753 


93.218 


-1.843 


1.00 


47.69 


2045 


C i 


PRO 


419 


7.625 


90.084 


-2.284 


1.00 


46.08 


2046 


0 


PRO 


419 


8.133 


90.401 


-3.355 


1.00 


46.98 


2047 


N 


PHE 


420 


6.932 


88.961 


-2.120 


1.00 


44.39 


2048 


CA 


PHE 


420 


6.730 


88.021 


-3.227 


1.00 


43.32 


2049 


CB 


PHE 


420 


8.023 


87.253 


-3.508 


1.00 


42.27 


2050 


CG 


PHE 


420 


8.301 


86.182 


-2.502 


1.00 


41.66 


2051 


CD1 


PHE 


420 


7.779 


84.906 


-2.665 


1.00 


41.66 


2052 


CD2 


PHE 


420 


9.003 


86.473 


-1.340 


1.00 


41.36 


2053 


CE1 


PHE 


420 


7.946 


83.942 


-1 .679 


1.00 


41 .75 


2054 


CE2 


PHE 


420 


9.172 


85.513 


-0.351 


1.00 


40.46 


2055 


CZ 


PHE 


420 


8.641 


84.251 


-0.522 


1.00 


40.32 


2056 


C 


PHE 


420 


5.634 


87.020 


-2.865 


1.00 


41.58 


2057 


0 


PHE 


420 


5.030 


86.397 


-3.735 


1.00 


40.37 


2058 


N 


ALA 


421 


5.402 


86.855 


-1.569 


1.00 


39.94 


2059 


CA 


ALA 


421 


4.383 


85.940 


-1.091 


1.00 


39.63 


2060 


CB 


ALA 


421 


4.312 


86.002 


0.431 


1.00 


38.95 


2061 


C 


ALA 


421 


3.015 


86.268 


-1.700 


1.00 


39.52 


2062 


0 


ALA 


421 


2.658 


87.439 


-1.872 


1.00 


39.85 


2063 


N 


THR 


422 


2.256 


85.226 


-2.028 


1.00 


37.98 


2064 


CA 


THR 


422 


0.925 


85.391 


-2.610 


1.00 


35.41 


2065 


CB 


THR 


422 


0.444 


84.108 


-3.328 


1.00 


35.08 


2066 


0G1 


THR 


422 


0.384 


83.027 


-2.385 


1.00 


32.71 


2067 


CG2 


THR 


422 


1.381 


83.748 


-4.479 


1.00 


35.60 


2068 


C 


THR 


422 


-0.093 


85.672 


-1.519 


1.00 


34.77 


2069 


0 


THR 


422 


0.198 


85.516 


-0.332 


1 .00 


OO "70 

OO. ( O 


2070 


N 


PRO 


423 


-1.304 


86.104 


-1.912 


1 .00 


O/l OO. 

54. i.o 


2071 


CD 


PRO 


423 


-1.679 


86.572 


-3.263 


a c\r\ 

1 .00 


o4.14 


2072 


CA 


PRO 


423 


-2.370 


86.394 


-0.955 


1 .uu 


0\ .OO 


2073 


CB 


PRO 


423 


-3.567 


86.646 


-1 .856 


1 .uu 


O 1 . 1 D 


2074 


CG 


PRO 


423 


-2.946 


87.379 


o r\r\tr 

-2.995 


i .uu 


H 9 A 
OO.Z4 


ZU/ 0 


O 






-2.581 


85.200 


-0.031 


1.00 


31.37 


2076 


0 


PRO 


423 


-2.597 


85.350 


1.195 


1.00 


31.72 


2077 


N 


LEU 


424 


-2.729 


84.015 


-0.627 


1.00 


30.26 


2078 


CA 


| LEU 


424 


-2.926 


82.789 


0.139 


1.00 


28.31 


2079 


CB 


LEU 


424 


-3.014 


81.580 


-0.781 


1.00 


25.91 


2080 


CG 


LEU 


424 


-3.373 


80.305 


-0.015 


1.00 


29.72 


2081 


CD1 


LEU 


424 


-4.792 


80.435 


0.517 


1.00 


29.37 


2082 


CD2 


LEU 


424 


-3.257 


79.080 


-0.919 


1.00 


31.09 


2083 


: C 


LEU 


424 


-1.764 | 82.599 


1.100 


1.00 


29.68 
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2084 


0 


LEU 


424 


-1 .967 


82.373 


2.292 


1 .00 




2085 


N 


MET 


425 


-0.543 


82.694 


0.584 


1 .UU 




2086 


CA 


MET 


425 


0.643 


82.551 


1.420 


a r>n 

1 .00 


on on 

zy.yu 


2087 


CB 


MET 


425 


1 .901 


82.755 


0.583 


A f\f\ 

1 .00 


^y.oy 


2088 


CG 


MET 


425 


2.213 


81.577 


-0.312 


1 .00 


oa on 


2089 


SD 


MET 


425 


3.537 


81.924 


-1.456 


1 .00 


32.42 


2090 


CE 


MET 


425 


4.944 


82.048 


-0.372 


1 .00 


on no 

29. Oo 


2091 


C 


MET 


425 


0.631 


83.536 


2.585 


1 .00 


on dT 

30.67 


2092 


0 


MET 


425 


1.054 


83.210 


3.694 


1.00 


30.26 


2093 


N 


GLN 


426 


0.151 


84.745 


2.324 


1.00 


31.96 


2094 


CA 


GLN 


426 


0.067 


85.777 


3.349 


1.00 


33.65 


2095 


CB 


GLN 


426 


-0.409 


87.102 


2.719 


1.00 


on nn 

32.99 


2096 


CG 


GLN 


426 


0.650 


87.813 


1.870 


1.00 


31.49 


2097 


CD 


GLN 


426 


0.080 


88.924 


0.998 


1.00 


on o c 

32.35 


2098 


OE1 


GLN 


426 


-0.741 


89.721 


1.445 


1.00 


34.92 


2099 


NE2 


GLN 


426 


0.527 


88.987 


-0.250 


1.00 


30.45 


2100 


C : 


GLN 


426 


-0.905 


85.313 


4.444 


1.00 


35.37 


2101 


0 


GLN 


426 


-0.632 


85.464 


5.637 


1.00 


34.67 


2102 


N 


GLU 


427 


-2.035 


84.745 


4.031 


1.00 


37.68 


2103 


CA 


GLU 


427 


-3.040 


84.255 


4.969 


1.00 


40.84 


2104 


CB 


GLU 


427 


-4.156 


83.531 


4.214 


1.00 


43.08 


2105 


CG 


GLU 


427 


-5.136 


84.421 


3.482 


1.00 


47.76 


2106 


CD 


GLU 


427 


-6.240 


84.936 


4.385 


1.00 


51.36 


2107 


OE1 


GLU 


427 


-7.187 


85.586 


3.873 


1.00 


51.80 


2108 


OE2 


GLU 


427 


-6.160 


84.686 


5.609 


1.00 


54.10 


2109 


C 


GLU 


427 


-2.415 


83.277 


5.955 


1.00 


41.89 


2110 


O 


GLU 


427 


-2.392 , 


83.515 


7.163 


1.00 


41 .95 


2111 


N 


LEU 


428 


-1.915 


82.174 


5.405 


1.00 


43.21 


2112 


CA 


LEU 


428 


-1.293 


81.091 


6.160 


1.00 


44.72 


2113 


CB 


LEU 


428 


-0.738 


80.039 


5.197 


1.00 


43.74 


2114 


CG 


LEU 


428 


-1.751 


79.231 


4.385 


1.00 


44.22 


2115 


CD1 


LEU 


428 


-1 .074 


78.630 


3.159 


1.00 


44.72 


2116 


CD2 


LEU 


428 


-2.359 


78.153 


5.267 


1.00 


42.69 


2117 


C 


LEU 


428 


-0.185 


81.541 


7.091 


1.00 


46.27 


2118 


O 


LEU 


428 


-0.101 


81.090 


8.231 


1.00 


47.1 1 


2119 


N 


PHE 


429 


0.676 


82.420 


6.603 


1.00 


49.25 


2120 


CA 


PHE 


429 


1.776 


82.900 


7.418 


1.00 


52.66 


2121 


CB 


PHE 


429 


3.061 


82.929 


6.578 


1.00 


CO AC 

53.4o 


2122 


CG 


PHE 


429 


3.518 


81.555 


6.134 


1 .00 


C c o o 
OD.OO 


2123 


CD1 


PHE 


429 


2.769 


80.814 


5,219 


1 .00 


C7 on 

of .oy 


2124 


CD2 


PHE 


429 


4.666 


80.976 


6.675 


1 .uu 


0 f . 1 u 


2125 


CE1 


PHE 


429 


3.151 


79.512 


4.851 


1 .uu 


co OA 


2126 


CE2 


PHE 


429 


5.059 


79.673 


6.314 


1 .uu 


DO.\ i 


2127 


CZ 


PHE 


429 


4.298 


78.942 


5.402 


A AA 

\ .uu 


OD.UO 


2128 


C 


PHE 


429 


1.458 


84.264 


o no a 
O.U34 


1 .uu 


On.OH 


2129 
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PHE 


429 


0.288 
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84.999 


8.416 


1.00 


56.82 


2131 


CA 


GLY 
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2.259 


86.298 


9.018 


1.00 


60.20 


2132 


C 


GLY 


430. 


1.286 


87.162 


8.233 


1.00 


61.66 


2133 


0 


GLY 


430 


0.063 


87.075 


8.415 


1.00 


61.39 


2134 


N 


ILE 


431 


1.851 


87.989 


7,355 


1.00 


61.49 


2135 


CA 


lLE 


431 


1.115 


88.927 


6.513 


1.00 


61.54 


2136 


CB 


ILE 


431 


1.624 


88.874 


5.068 


1.00 


61.84 


2137 


CG2 


ILE 


431 


1.070 


90.056 


4.289 


1.00 


62.88 


2138 


CG1 


ILE 


431 


3.154 | 88.926 


5.044 


1.00 


62.50 
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2139 


CD1 


ILE 


431 


3.833 


87.651 


5.509 


1.00 


62.73 


2140 


C 


ILE 


431 


-0.405 


88.762 


6.496 


1.00 


61.67 


2141 


0 


ILE 


431 


-1.145 


89.729 


6.278 


1.00 


60.52 


2142 


0 


HOH 


1426 


-1.082 


69.659 


-2.625 


1.00 


24.14 


2143 


0 


HOH 


1427 


-10.671 


76.436 


6.561 


1.00 


39.57 


2144 


0 


HOH 


1428 


-2.237 


71.694 


12.302 


1.00 


19.63 


2145 


0 


HOH 


1430 


8,952 


56.719 


1.568 


1.00 


18.24 


2146 


0 


HOH 


1431 


-0.779 


59.892 


24.059 


1.00 


26.54 


2147 


0 


HOH 


1432 


22.789 


67.648 


-0.238 


1.00 


45.09 


2148 


0 


HOH 


1433 


-11.256 


70.329 


23.204 


1.00 


39.97 


2149 


■ 0 


HOH 


1434 


-10.044 


50.821 


16.739 


1.00 


41.82 


2150 


0 


HOH 


1436 


2.167 


79.769 


-17.224 


1.00 


30.96 


2151 


0 


HOH 


1437 


16.991 


84.149 


-14.225 


1.00 


48.61 


2152 


0 


HOH 


1438 


-11.704 


56.433 


-1.728 


1.00 


49.66 


2153 


0 


HOH 


1439 


16.619 


50.272 


11.443 


1.00 


32.78 


2154 


0 


HOH 


1440 


-0.731 


59.018 


-0.091 


1.00 


19.99 


2155 


0 * 


HOH 


1441 


-7.575 


88.256 


3.592 


1.00 


32.60 


2156 


0 


HOH 


1442 


-4.103 


71.133 


-5.988 


1.00 


43.57 


2157 


0 


HOH 


1443 


3.986 


90.037 


-1.136 


1.00 


47.02 


2158 


0 


HOH 


1444 


6.284 


48.618 


0.356 


1.00 


28.16 


2159 


0 


HOH 


1445 


7.455 


55.120 


-1.050 


1.00 


40.82 


2160 


0 


HOH 


1446 


9.754 


82.836 


-10.782 


1.00 


17.83 


2161 


0 


HOH 


1447 


-5.744 


62.064 


-7.344 


1.00 


26.53 


2162 


0 


HOH 


1449 


8.757 


42.647 


12.829 


1.00 


33.80 


2163 


0 


HOH 


1450 


-6.308 


70.707 


-8.559 


1.00 


43.71 


2164 


0 


HOH 


1452 


13.218 


57.431 


-5.501 


1.00 


44.44 


2165 


0 


HOH 


1453 


0.854, 


52.302 


-1.519 


1.00 
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78.428 


0.368 


0.40 


55.77 


2349 


C26 


MON 


4 


15.313 


79.458 


0.480 


0.40 


56.44 


2350 


027 


MON 


4 


9.716 


78.033 


2.935 


0.40 


56.39 


2351 


028 


MON 


4 


10.783 


79.845 


1.379 


0.40 


55.48 


2352 


C29 


MON 


4 


10.133 


80.538 


0.242 


0.40 


56.37 


2353 


C30 


MON 


4 


10.735 


81.909 


0.029 


0.40 


57.08 


2354 


031 


MON 


4 


8.770 


78.294 


0.725 


0.40 


55.95 


2355 


C32 


MON 


4 


8.067 


77.052 


0.863 


0.40 


55.63 


2356 


C33 


MON 


4 


6.756 


77.095 


0.032 


0.40 


56,56 


2357 


C1 


MON 


5 


12.648 


74.791 


0.798 


0.20 


47.05 


2358 


C2 


MON 


5 | 13.126 | 76.119 


0.932 


0.20 


47.32 
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2359 


C3 


MON 


5 


12.656 


77.303 


0.159 


0.20 


47.22 


2360 


C4 


MON 


5 


11.616 


76.953 


-0.780 


0.20 


47.19 


2361 


C5 


MON 


5 


11.126 


75.646 


-0.931 


0.20 


47.61 


2362 


C6 


MON 


5 


11.639 


74.637 


-0.165 


0.20 


47.35 


2363 


C7 


MON 


5 


10.111 


75.325 


-1.848 


0.20 


47.58 


2364 


C8 


MON 


5 


8.874 


74.929 


-1.501 


0.20 


48.33 


2365 


P9 


MON 


5 


7.588 


75.119 


-2.808 


0.20 


47.86 


2366 


P10 


MON 


5 


8.265 


74.266 


0.100 


0.20 


50.66 


2367 


011 


MON 


5 


14.069 


76.314 


1.815 


0.20 


47.52 


2368 


C12 


MON 


5 


13.108 


73.595 


1.565 


0.20 


46.60 


2369 


C13 


MON 


5 


13.080 


78.852 


0.183 


0.20 


47.46 


2370 


C14 


MON 


5 


12.817 


73.796 


3.086 


0.20 


45.89 


2371 


C15 


MON 


5 


14.636 


73.345 


1.297 


0.20 


46.19 


2372 


C16 


MON 


5 


12.362 


72.280 


1.154 


0.20 


46.10 


2373 


C17 


MON 


5 


11.819 


79.734 


0.589 


0.20 


47.01 


2374 


C18 


MON 


5 


14.218 


79.266 


1.196 


0.20 


47.12 


2375 


C19: 


MON 


5 


13.555 


79.299 


-1.261 


0.20 


47.39 


2376 


O20 


MON 


5 


6.243 


75.028 


-2.117 


0.20 


47.51 


2377 


021 


MON 


5 


7.612 


73.832 


-3.795 


0.20 


47.61 


2378 


C22 


MON 


5 


8.896 


73.594 


-4.380 


0.20 


47.53 


2379 


C23 


MON 


5 


8.868 


72.365 


-5.290 


0.20 


47.07 


2380 


024 


MON 


5 


7.601 


76.547 


-3.505 


0.20 


48.11 


2381 


C25 


MON 


5 


8.083 


77.621 


-2.598 


0.20 


48.54 


2382 


C26 


MON 


5 


8.106 


78.979 


-3.250 


0.20 


48.93 


2383 


027 


MON 


5 


9.172 


74.062 


1.195 


0.20 


50.80 


2384 


028 


MON 


5 


6.985 


75.197 


0.526 


0.20 


50.30 


2385 


C29 


MON 


5 


7.371 , 


76.386 


1.227 


0.20 


50.25 


2386 


C30 


MON 


5 


6.133 


77.218 


1.594 


0.20 


49.96 


2387 


031 


MON 


5 


7.569 


72.853 


-0.242 


0.20 


49.96 


2388 


C32 


MON 


5 


8.536 


71.922 


-0.647 


0.20 


49.66 


2389 


C33 


MON 


5 


7.882 


70.608 


-0.958 


0.20 


49.90 


2390 


0 


HOH 


1595 


7.323 


78.531 


4.526 


1.00 


69.10 


2391 


0 


HOH 


1596 


17.213 


76.326 


0.429 


1.00 


57.74 


2392 


0 


HOH 


1597 


-12.947 


67.694 


7.441 


1.00 


77.03 


2393 


0 


HOH 


1598 


16.163 


67.413 


-22.431 


1.00 


40.69 


2394 


o 


HOH 


1599 


18.369 


92.512 


7.265 


1.00 


52.48 


2395 


0 


HOH 


1600 


29.442 


64.461 


-1.726 


1.00 


66.79 


2396 


0 


HOH 


1601 


19.427 


85.921 


-22.662 


1.00 


60.16 


2397 


0 


HOH 


1602 


5.344 


90.815 


7.154 


1.00 


54.96 


2398 


o 


HOH 


1603 


-14.216 


50.571 


5.561 


1.00 


54.96 


2399 


0 


HOH 


1604 


5.533 


45.964 


0.404 


1.00 


62.55 


2400 


0 


HOH 


1605 


-1 .394 


63.145 


20.705 


1.00 


40.08 


2401 


o 


HOH 


1606 


-2.578 


54.566 


22.874 


1.00 


57.40 


2402 


o 


HOH 


1607 


3.600 


69.196 


22.807 


1.00 


54.51 


2403 


0 


HOH 


1608 


6.139 


65.007 


-18.611 


1.00 


54.86 


2404 


0 


HOH 


1609 


4.202 


75.224 


-27.568 


1.00 


58.04 


2405 


o 


HOH 


1610 


-5.421 


61 .703 


24.061 


1.00 


57.88 


2406 


o 


HOH 


1611 


-1 1 .943 


45.372 


11.041 


1.00 


62.72 
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TABLE 5 



ATOMIC COORDINATE DATA FOR HUMAN VITAMIN D RECEPTOR 
EMPLOYED IN THE MOLECULAR REPLACEMENT SOLUTION OF HUMAN 
PXR LIGAND BINDING DOMAIN CRYSTALS 



ATOM 


ATOM 
TYPE 


RESIDUE 


# 


X 


Y 


2 


OCC 


B 


ATOM 


1 


N 


LEU 


A 


120 


34.417 


18.787 


67.312 


1.00 


50.31 


2 


CA 


LEU 


A 


120 


34.298 


17.304 


67.212 


1.00 


49.96 


3 


c 


LEU 


A 


120 


33.672 


16.891 


65.886 


1.00 


49.44 


4 


0 


LEU 


A 


120 


32.815 


17.592 


65.344 


1.00 


49.49 


5 


CB 


LEU 


A 


120 


33.447 


16.756 


68.363 


1.00 


50.64 


9 


N 


ARG 


A 


121 


34.106 


15.745 


65.375 


1.00 


48.14 


10 


CA 


ARG 


A 


121 


33.599 


15.221 


64.117 


1.00 


47.01 


11 


c 


ARG 


A 


121 


33.113 


13.790 


64.314 


1.00 


45.50 


12 


o 


ARG 


A 


121 


33.775 


12.836 


63,905 


1.00 


45.36 


13 


CB 


ARG 


A 


121 


34.700 


15.264 


63.052 


1.00 


48.45 


14 


CG 


ARG 


A 


121 


35.233 


16.664 


62.790 


1.00 


49.89 


15 


CD 


ARG 


A 


121 


36.430 


16.655 


61.852 


1.00 


•52.32 


16 


NE 


ARG 


A 


121 


36.100 


16.133 


60.529 


1.00 


53.49 


17 


CZ 


ARG 


A 


121 


36.947 


16.112 


59.504 


1.00 


54.08 


18 


NH1 


ARG 


A 


121 


38.178 


16.586 


59.648 


1.00 


54.50 


19 


NH2 


ARG 


A 


121 


36.563 


15.620 


58.334 


1,00 


54.12 


20 


N 


PRO 


A 


122 


31.946 


13.622 


64.955 


1.00 


43.87 


21 


CA 


PRO 


A 


122 


31.403 


12.282 


65.187 


1.00 


42.99 


22 


C 


PRO 


A 


122 


31.173 


1 1 .529 


63.881 


1.00 


42.25 


23 


O 


PRO 


A 


122 


30.823 


12.125 


62.862 


1.00 


42.01 


24 


CB 


PRO 


A 


122 


30.105 


12.561 


65.944 


1.00 


42.59 


27 


N 


LYS 


A 


123 


31.379 


10.218 


63.920 


1.00 


41.53 


28 


CA 


LYS 


A 


123 


31.205 


9.378 


62.744 


1.00 


41.30 


29 


C 


LYS 


A 


123 


29.732 


9.158 


62.431 


1.00 


40.35 


30 


O 


LYS 


A 


123 


28.877 


9.250 


63.313 


1.00 


39.21 


31 


CB 


LYS 


A 


123 . 


31.885 


8.024 


62.965 


1.00 


42.56 


36 


N 


LEU 


A 


124 


29.439 


8.879 


61.165 


1.00 


39.48 


37 


CA 


LEU 


A 


124 


28.071 


8.622 


60.744 


1.00 


38.64 


38 


C 


LEU 


A 


124 


27.606 


7.325 


61.384 


1.00 


38.41 


39 


O 


LEU 


A 


124 


28.293 


6.308 


61 .304 


1.00 


39.12 


40 


CB 


LEU 


A 


124 


27.996 


8.491 


59.220 


1.00 


37.76 


41 


CG 


LEU 


A 


124 


28.162 


9.776 


58.406 


1.00 


37.83 


42 


CD1 


LEU 


A 


124 


28.401 


9.438 


56,941 


1,00 


37.98 


43 


CD2 


LEU 


A 


124 


26.922 


10.633 


58.564 


1.00 


36.97 


44 


N 


SER 


A 


125 


26.448 


7.362 


62.029 


1.00 


38.47 


45 


CA 


SER 


A 


125 


25.905 


6.168 


62.661 


1.00 


39.40 


46 


C 


SER 


A 


125 


25.496 


5.197 


61.561 


1.00 


40.52 


47 


O 


SER 


A 


125 


25.386 


5.581 


60.395 


1.00 


39.53 


48 


CB 


SER 


A 


125 


24.679 


6.523 


63.495 


1.00 


39.88 


50 


N 


GLU 


A 


126 


25.271 


3.940 


61.923 


1.00 


41.33 


51 


CA 


GLU 


A 


126 


24.865 


2.956 


60.930 


1.00 


42.41 
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52 


C 


GLU 


A 


126 


23.535 


3.385 


60.314 


1.00 


41.49 


53 


0 


GLU 


A 


126 


23.313 


3.207 


59.115 


1.00 


41.40 


54 


CB 


GLU 


A 


126 


24.727 


1.573 


61.573 


1.00 


45.02 


55 


CG 


GLU 


A 


126 


24.325 


0.463 


60.605 


1.00 


48.95 


56 


CD 


GLU 


A 


126 


25.202 


0.414 


59.361 


1.00 


51.93 


57 


0E1 


GLU 


A 


126 


24.878 


1.105 


58.366 


1.00 


53.34 


58 


0E2 


GLU 


A 


126 


26.222 


-0.308 


59.379 


1.00 


53.64 


59 


N 


GLU 


A 


127 


22.659 


3.960 


61.133 


1.00 


40.27 


60 


CA 


GLU 


A 


127 


21.358 


4.412 


60.650 


1.00 


39.52 


61 


C 


GLU 


A 


127 


21.512 


5.550 


59.647 


1.00 


37.38 


62 


0 


GLU 


A 


127 


20.814 


5.594 


58.630 


1.00 


36.24 


63 


CB 


GLU 


A 


127 


20.481 


4.891 


61.807 


1.00 


41.53 


64 


CG 


GLU 


A 


127 


19.091 


5.320 


61.363 


1.00 


45.78 


65 


CD 


GLU 


A 


127 


18.236 


5.832 


62.504 


1.00 


47.87 


66 


0E1 


GLU 


A 


127 


18.572 


6.890 


63.075 


1.00 


49.93 


67 


0E2 


GLU 


A 


127 


17.227 


5.173 


62.832 


1.00 


50.45 


68 


N i 


GLN 


A 


128 


22.420 


6.473 


59.939 


1.00 


34.92 


69 


CA 


GLN 


A 


128 


22.654 


7.603 


59.052 


1.00 


33.94 


70 


C 


GLN 


A 


128 


23.239 


7.134 


57.721 


1.00 


34.19 


71 


0 


GLN 


A 


128 


22.905 


7.671 


56.665 


1.00 


32.45 


72 


CB 


GLN 


A 


. 128 


23.573 


8.622 


59.735 


1.00 


33.20 


73 


CG 


GLN 


A 


128 


22.861 


9.410 


60.835 


1.00 


32.00 


74 


CD 


GLN 


A 


128 


23.785 


10.317 


61.629 


1.00 


32.20 


75 


0E1 


GLN 


A 


128 


23.346 


11.326 


62.192 


1.00 


33.66 


76 


NE2 


GLN 


A 


128 


25.061 


9.960 


61.691 


1.00 


30.80 


77 


N 


GLN 


A 


129 


24,101 


6.124 


57.768 


1.00 


33.75 


78 


CA 


GLN 


A 


129 


24.692 


5.591 


56.545 


1.00 


35.00 


79 


C 


GLN 


A 


129 


23.588 


4.965 


55.702 


1.00 


34.31 


80 


0 


GLN 


A 


129 


23.562 


5.111 


54.479 


1.00 


33.78 


81 


CB 


GLN 


A 


129 


25.747 


4.531 


56.874 


1.00 


37.89 


86 


N 


ARG 


A 


130 


22.674 


4.270 


56.370 


1.00 


33.44 


87 


CA 


ARG 


A 


130 


21.556 


3.614 


55.703 


1.00 


34,05 


88 


C 


ARG 


A 


130 


20.653 


4.638 


55.018 


1.00 


32.98 


89 


0 


ARG 


A 


130 


20.226 


4.436 


53.881 


1.00 


31.44 


90 


CB 


ARG 


A 


130 


20.759 


2.794 


56.723 


1.00 


37.04 


97 


N 


ILE 


A 


131 


20.367 


5.735 


55.712 


1.00 


31.16 


98 


CA 


ILE 


A 


131 


19.519 


6.790 


55.158 


1.00 


30.4.1 


99 


C 


ILE 


A 


131 


20.120 


7.343 


53.865 


1.00 


29.21 


100 


0 


ILE 


A 


131 


19.414 


7.528 


52.872 


1.00 


27.86 


101 


CB 


ILE 


A 


131 


19.334 


7.945 


56.177 


1.00 


31,61 


105 


N 


ILE 


A 


132 


21.424 


7.601 


53.876 


1.00 


28.81 


106 


CA 


ILE 


A 


132 


22.094 


8.124 


52.691 


1.00 


29.13 


107 


C 


ILE 


A 


132 


22.029 


7.115 


51.544 


1.00 


29.37 


108 


0 


ILE 


A 


132 


21.786 


7.486 


50.394 


1.00 


28.72 


109 


CB 


ILE 


A 


132 


23.570 


8.468 


52.994 


1.00 


29.90 


110 


CG1 


ILE 


A 


132 


23.628 


9.625 


53.995 


1.00 


30.31 


111 


CG2 


ILE 


A 


132 


24.306 


8.838 


51 .708 


1 .00 


oU.oZ 


112 


CD1 


ILE 


A 


132 


25.027 


9.997 


54.432 


1.00 


31.33 


113 


N 


ALA 


A 


133 


22.239 


5.841 


51.862 


1.00 


28.31 


114 


CA 


ALA 


A 


133 


22.203 


4.785 


50.851 


1.00 


27.51 


115 


C 


ALA ■ 


A 


133 


20.820 


4.680 


50.213 


1.00 


26.94 


116 


0 


ALA 


A 


133 


20.694 


4.542 


48.993 


1.00 


26.91 


117 


CB 


ALA 


A 


133 


22.587 


3.454 


51.479 


1.00 


27.94 


118 


N 


ILE 


A 


134 


19.786 


4.739 


51.044 


1.00 


26.00 


119 


CA 


ILE 


A 


134 


18.413 


4.659 


50.564 


1.00 


25.19 
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120 


C 


ILE 


A 


134 | 18.0,90 


5.832 


49.643 


1.00 


24.84 


121 


o 


ILE 


A 


134 


17.490 


5.651 


48.585 


1.00 


23.10 


122 


CB 


ILE 


A 


134 


17.416 


4.660 


51.742 


1.00 


26.47 


126 


N 


LEU 


A 


135 


18.494 


7.030 


50.047 


1.00 


23.54 


127 


CA 


LEU 


A 


135 


18.228 


8.220 


49.242 


1.00 


23.28 


128 


C 


LEU 


A 


135 


18.987 


8.217 


47.914 


1.00 * 


22.05 


129 


O 


LEU 


A 


135 


18.454 


8.656 


46.894 


1.00 


21.44 


130 


CB 


LEU 


A 


135 


18.559 


9.480 


50.045 


1.00 


23.21 


131 


CG 


LEU 


A 


135 


17.644 


9.754 


51.246 


1.00 


24.57 


132 


CD1 


LEU 


A 


135 


18.057 


11.076 


51.900 


1.00 


26.44 


133 


CD2 


LEU 


A 


135 


16.185 


9.820 


50.789 


1.00 


25.56 


134 


N 


LEU 


A 


136 


20.223 


7.725 


47.913 


1.00 


22.40 


135 


CA 


LEU 


A 


136 


20.991 


7.675 


46.669 


1.00 


23.29 


136 


C 


LEU 


A 


136 


20.302 


6.721 


45.705 


1.00 


23.50 


137 


o 


LEU 


A 


136 


20.191 


6.996 


44.512 


1.00 


23.31 


138 


CB 


LEU 


A 


136 


22.424 


7.194 


46.920 


1.00 


24.60 


142 


N : 


ASP 


A 


137 


19.845 


5.591 


46.232 


1.00 


23.87 


143 


CA 


ASP 


A 


137 


19.156 


4.589 


45.427 


1.00 


23.95 


144 


C 


ASP 


A 


137 


17.844 


5.152 


44.870 


1.00 


23.67 


145 


0 


ASP 


A 


137 


17.513 


4.943 


43.697 


1.00 


22.79 


146 


CB 


ASP 


A 


137 


18.886 


3.348 


46.282 


1.00 


26.93 


147 


CG 


ASP 


A 


137 


18.158 


2.266 


45.524 


1.00 


31.10 


148 


OD1 


ASP 


A 


137 


17.010 


1.947 


45.900 


1.00 


34.78 


149 


OD2 


ASP 


A 


137 


18.730 


1.734 


44.552 


1.00 


34.13 


150 


N 


ALA 


A 


138 


17.105 


5.867 


45.714 


1.00 


22.31 


151 


CA 


ALA 


A 


138 


15.836 


6.472 


45.312 


1.00 


22.31 


152 


C 


ALA 


A 


138 


16.063 


7.435 


44.157 


1.00 


21.39 


153 


O 


ALA 


A 


138 


15.310 


7.445 


43.183 


1.00 


20.83 


154 


CB 


ALA 


A 


138 


15.213 


7.219 


46.487 


1.00 


23.04 


155 


N 


HIS 


A 


139 


17.107 


8.249 


44.263 


1,00 


21.06 


156 


CA 


HIS 


A 


139 


17.408 


9.202 


43.208 


1.00 


21.28 


157 


C 


HIS 


A 


139 


17.814 


8.511 


41.905 


1.00 


21.64 


158 


O 


HIS 


A 


139 


17.385 


8.913 


40.824 


1.00 


21.17 


159 


CB 


HIS 


A 


139 


18.528 


10.152 


43.631 


1.00 


21.21 


165 


N 


HIS 


A 


140 
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7.800 


7.768 


48.959 


1.00 


28.66 
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596 


C 


LYS 


A 


246 


8.914 


6.918 


49.562 


1.00 


29.21 


597 


0 


LYS 


A 


246 


8.668 


6.117 


50.466 


1.00 


29.75 


598 


CB 


LYS 


A 


246 


6.997 


6.931 


47.957 


1.00 


30.93 


599 


CG 


LYS 


A 


246 


5.702 


7.593 


47.501 


1.00 


34.75 


600 


CD 


LYS 


A 


246 


5.017 


6.811 


46.383 


1.00 


37.28 


601 


CE 


LYS 


A 


246 


4.410 


5.501 


46.873 


1.00 


40.02 


602 


NZ 


LYS 


A 


246 


3.230 


5.724 


47.756 


1.00 


42.15 


603 


N 


MET 


A 


247 


10.138 


7.104 


49.074 


1.00 


28.68 


604 


CA 


MET 


A 


247 


11.282 


6.339 


49.562 


1.00 


29.45 


605 


C 


MET 


A 


247 


12.076 


7.021 


50.681 


1.00 


28.75 


606 


O 


MET 


A 


247 


13.012 


6.431 


51.230 


1.00 


28.61 


607 


CB 


MET 


A 


247 


12.219 


5.990 


48.396 


1.00 


30.97 


611 


N 


ILE 


A 


248 


11.709 


8.253 


51.023 


1.00 


27.31 


612 


CA 


ILE 


A 


248 


12.391 


8.973 


52.100 


1.00 


28.07 


613 


C 


ILE 


A 


248 


12.033 


8.295 


53.420 


1.00 


28.99 


614 


O 


ILE 


A 


248 


10.859 


8.179 


53.763 


1.00 


28.97 


615 


CB 


ILE 


A 


248 


11.934 


10.446 


52.195 


1.00 


27.49 


616 


CG1 * 


ILE 


A 


248 


12.299 


11.198 


50.916 


1.00 


26.51 


617 


CG2 


ILE 


A 


248 


12.582 


11.114 


53.411 


1.00 


28.63 


618 


CD1 


ILE 


A 


248 


11.730 


12.605 


50.865 


1.00 


25.47 


619 


N 


PRO 


A 


249 


13.041 


7.844 


54.181 


1.00 


30.64 


620 


CA 


PRO 


A 


249 


12.764 


7.182 


55.460 


1.00 


31.91 


621 


C 


PRO 


A 


249 


11.818 


7.992 


56.348 


1.00 


32.16 


622 


O 


PRO 


A 


249 


12.107 


9.138 


56.688 


1.00 


34.02 


623 


CB 


PRO 


A 


249 


14.153 


7.035 


56.075 


1.00 


32.36 


626 


N 


GLY 


A 


250 


10.686 


7.392 


56.706 


1.00 


32.30 


627 


CA 


GLY 


A 


250 


9.725 


8.064 


57.565 


1.00 


33.31 


628 


C 


GLY 


A 


250 


. 8.542 


8.700 


56.858 


1.00 


33.48 


629 


O 


GLY 


A 


250 


7.484 


8.888 


57.459 


1.00 


33.45 


630 


N 


PHE 


A 


251 


8.709 


9.023 


55.579 


1.00 


33.51 


631 


CA 


PHE 


A 


251 


7.643 


9.658 


54.809 


1.00 


33.69 


632 


C 


PHE 


A 


251 


6.335 


8.871 


54.833 


1.00 


34.57 


633 


O 


PHE 


A 


251 


5.259 


9.455 


54.964 


1.00 


35.10 


634 


CB 


PHE 


A 


251 


8.082 


9.850 


53.356 


1.00 


31.35 


635 


CG 


PHE 


A 


251 


7.180 


10.754 


52.564 


1.00 


29.89 


636 


CD1 


PHE 


A 


251 


7.234 


12.134 


52.735 


1.00 


30.12 


637 


CD2 


PHE 


A 


251 


6.276 


10.227 


51.643 


1.00 


30.05 


638 


CE1 


PHE 


A 


251 


6.400 


12.979 


51.999 


1.00 


29.28 


639 


CE2 


PHE 


A 


251 


5.441 


11.063 


50.906 


1.00 


28.78 


640 


CZ 


PHE 


A 


251 


5.505 


12.440 


51.085 


1.00 


28.48 


641 


N 


ARG 


A 


252 


6.431 


7.551 


54.703 


1.00 


36.83 


642 


CA 


ARG 


A 


252 


5.250 


6.691 


54.698 


1.00 


39.19 


643 


C 


ARG 


A 


252 


4.535 


6.647 


56.045 


1.00 


39.61 


644 


O 


ARG 


A 


252 


3.391 


6.200 


56.127 


1.00 


40.31 


645 


CB 


ARG 


A 


252 


5.625 


5.262 


54.292 


1.00 


41.06 


646 


CG 


ARG 


A 


252 


6.138 


5.101 


52.867 


1.00 


44.96 


647 


CD 


ARG 


A 


252 


6.260 


3.620 


52.516 


1.00 


47.63 


648 


NE 


ARG 


A 


252 


6.777 


3.393 


51.169 


1.00 


50,79 


649 


CZ 


ARG 


A 


252 


8.062 


3.459 


50.831 


1.00 


51.79 


650 


NH1 


ARG 


A 


252 


8.982 


3.745 


51.745 


1.00 


52.82 


651 


NH2 


ARG 


A 


252 


8.427 


3.235 


49.576 


1.00 


52.64 


652 


N 


ASP 


A 


253 


5.205 


7.102 


57.098 


1.00 


39.96 


653 


CA 


ASP 


A 


253 


4.610 


7.097 


58.430 


1.00 


40.45 


654 


C 


ASP 


A 


253 


3.648 


8.255 


58.635 


1.00 


39.90 


655 


0 


ASP | A 


253 


2.902 


8.284 


59.612 


1.00 


39.68 
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656 


CB 


ASP 


A 


253 


5.698 


7.127 


59.506 ' 


1.00 


42.53 


657 


CG 


ASP 


A 


253 


6.524 


5.856 


59.531 


1.00 


44.84 


658 


0D1 


ASP 


A 


253 


5.938 


4.767 


59.345 


1.00 


47.60 


659 


0D2 


ASP 


A 


253 


7.752 


5.942 


59.743 


1.00 


45.66 


660 


N 


LEU 


A 


254 


3.669 


9.208 


57.710 


1.00 


38.00 


661 


CA 


LEU 


A 


254 


2.782 


10.361 


57.780 


1.00 


37.81 


662 


C 


LEU 


A 


254 


1.417 


9.978 


57.218 


1.00 


37.76 


663 


0 


LEU 


A 


254 


1.293 


9.000 


56.476 


1.00 


37.49 


664 


CB 


LEU 


A 


254 


3.348 


11.521 


56.955 


1.00 


36.51 


665 


CG 


LEU 


A 


254 


4.707 


12.101 


57.346 


1.00 


36.86 


666 


CD1 


LEU 


A 


254 


5.142 


13.113 


56.297 


1.00 


35.66 


667 


CD2 


LEU 


A 


254 


4.620 


12.751 


58.719 


1.00 


36.85 


668 


N 


THR 


A 


255 


0.395 


10.745 


57.579 


1.00 


38.14 


669 


CA 


THR 


A 


255 


-0.950 


10.496 


57.077 


1.00 


39.08 


670 


C 


THR 


A 


255 


-0.928 


10.828 


55.588 


1.00 


39.76 


671 


O 


THR 


A 


255 


-0.075 


11.593 


55.139 


1.00 


39.15 


672 


CB 


THR 


A 


255 


-1.982 


11.397 


57.781 


1.00 


39.68 


675 


N * 


SER 


A 


256 


-1.851 


10.258 


54.817 


1.00 


40.24 


676 


CA 


SER 


A 


256 


-1.883 


10.527 


53.382 


1.00 


40.92 


677 


C 


SER 


A 


256 


-2.148 


12.010 


53.152 


1.00 


40.23 


678 


O 


SER 


A 


256 


-1.662 


12.599 


52.185 


1.00 


40.28 


679 


CB 


SER 


A 


256 


-2.968 


9.693 


52.690 


1.00 


41.53 


681 


N 


GLU 


A 


257 


-2.916 


12.610 


54.056 


1.00 


39.55 


682 


CA 


GLU 


A 


257 


-3.252 


14.024 


53.963 


1.00 


38.71 


683 


C 


GLU 


A 


257 


-1 .999 


14.889 


54.038 


1.00 


36.50 


684 


0 


GLU 


A 


257 


-1.825 


15.810 


53.240 


1.00 


36.20 


685 


CB 


GLU 


A 


257 


-4.221 


14.400 


55.085 


1.00 


41.43 


686 


CG 


GLU 


A 


257 


-4.650 


15.853 


55.090 


1.00 


44.63 


687 


CD 


GLU 


A 


257 


-5.747 


16.121 


56.103 


1.00 


47.39 


688 


OE1 


GLU 


A 


257 


-6.879 


15.634 


55.896 


1.00 


48.76 


689 


OE2 


GLU 


A 


257 


-5.476 


16.810 


57.109 


1.00 


48.99 


690 


N 


ASP 


A 


258 


-1.132 


14.593 


55.001 


1.00 


34.59 


691 ■ 


CA 


ASP 


A 


258 


0.111 


15.339 


55.159 


1.00 


33.04 


692 


C 


ASP 


A 


258 


1.064 


15.047 


54.002 


1.00 


32.48 


693 


0 


ASP 


A 


258 


1.782 


15.934 


53.546 


1.00 


31.37 


694 


CB 


ASP 


A 


258 


0.784 


14.984 


56.488 


1.00 


34.07 


695 


CG 


ASP 


A 


258 


0.256 


15.809 


57.645 


1.00 


35.11 


696 


OD1 


ASP 


A 


258 


0.599 


15.501 


58.807 


1.00 


35.63 


697 


OD2 


ASP 


A 


258 


-0.493 


16.775 


57.386 


1.00 


34.86 


698 


N 


GLN 


A 


259 


1.072 


13.803 


53.532 


1.00 


31.90 


699 


CA 


GLN 


A 


259 


1.940 


13.433 


52.417 


1.00 


32.81 


700 


C 


GLN 


A 


259 


1.611 


14.272 


51.184 


1.00 


32.59 


701 


0 


GLN 


A 


259 


2.505 


14.820 


50.534 


1.00 


32.51 


702 


CB 


GLN 


A 


259 


1.783 


11.946 


52.077 


1.00 


32.98 


703 


CG 


GLN 


A 


259 


2.217 


11.000 


53.181 


1.00 


34.94 


704 


CD 


GLN 


A 


259 


2.168 


9.547 


52,755 


1.00 


37.19 


705 


OE1 


GLN 


A 


259 


2.322 


8.641 


53.576 


1.00 


39.55 


706 


NE2 


GLN 


A 


259 


1.958 


9.315 


51,466 


1.00 


37.81 


707 


N 


ILE 


A 


260 


0.325 


14.375 


50.866 


1.00 


32.68 


708 


CA 


ILE 


A 


260 


-0.109 


15.147 


49.706 


1.00 


32.42 


709 


C 


ILE 


A 


260 


0.183 


16.634 


49.880 


1.00 


31.57 


710 


0 


ILE 


A 


260 


0.588 


17.311 


48.933 


1.00 


30.43 


711 


CB 


ILE 


A 


260 


-1.619 


14.959 


49.445 


1.00 


33.97 


712 


CG1 


ILE 


A 


260 


-1 .933 


13.471 


49.277 


1.00 


34.59 


713 


CG2 


ILE 


A 


260 


-2.036 


15.731 


48.201 


1.00 


33.83 
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714 


CD1 


ILE 


A 


260 


-1.156 


12.789 


48.165 


1.00 


36.85 


715 


N 


VAL 


A 


261 


-0.029 


17.146 


51.088 


1.00 


29.87 


716 


CA 


VAL 


A 


261 


0.244 


18.551 


51.358 


1.00 


28.91 


717 


C 


VAL 


A 


261 


1.717 


18.862 


51.097 


1.00 


27.91 


718 


0 


VAL 


A 


261 


2.043 


19.856 


50.445 


1.00 


28.13 


719 


CB 


VAL 


A 


261 


-0.089 


18.923 


52.827 


1.00 


28.91 


722 


N 


LEU 


A 


262 


2.605 


18.011 


51.604 


1.00 


27.17 


723 


CA 


LEU 


A 


262 


4.039 


18.222 


51.423 


1.00 


25.81 


724 


C 


LEU 


A 


262 


4.461 


18.126 


49.955 


1.00 


25.46 


725 


0 


LEU 


A 


262 


5.274 


18.921 


49.485 


1.00 


24.7B 


726 


CB 


LEU 


A 


262 


4.836 


17.219 


52.265 


1.00 


26.02 


111 


CG 


LEU 


A 


262 


4.604 


17.278 


53.781 


1.00 


25.71 


728 


CD1 


LEU 


A 


262 


5.382 


16.162 


54.464 


1.00 


26.98 


729 


CD2 


LEU 


A 


262 


5.028 


18.634 


54.317 


1.00 


26.30 


730 


N 


LEU 


A 


263 


3.911 


17.155 


49.232 


1.00 


25.71 


731 


CA 


LEU 


A 


263 


4.244 


16.985 


47.818 


1.00 


26.15 


732 


C 


LEU 


A 


263 


3.763 


18.166 


46.974 


1.00 


26.42 


733 


0 i 


LEU 


A 


263 


4.514 


18.702 


46.154 


1.00 


25.91 


734 


CB 


LEU 


A 


263 


3.633 


15.681 


47.283 


1.00 


27.20 


735 


CG 


LEU 


A 


263 


4.293 


14.376 


47.745 


1.00 


29.34 


736 


CD1 


LEU 


A 


263 


3.401 


13.197 


47.404 


1.00 


30.14 


737 


CD2 


LEU 


A 


263 


5.658 


14.223 


47.082 


1.00 


31.00 


738 


N 


LYS 


A 


264 


2.519 


18.585 


47.178 


1.00 


25.69 


739 


CA 


LYS 


A 


264 


1.987 


19.699 


46.405 


1.00 


26.59 


740 


C 


LYS 


A 


264 


2.709 


21.011 


46.655 


1.00 


26.97 


741 


0 


LYS 


A 


264 


2.962 


21.767 


45.723 


1.00 


27.99 


742 


CB 


LYS 


A 


264 


0.496 


19.899 


46.688 


1.00 


29.36 


743 


CG 


LYS 


A 


264 


-0.417 


18.910 


45.994 


1.00 


31.84 


744 


CD 


LYS 


A 


264 


-1.862 


19.348 


46.156 


1.00 


35.05 


745 


CE 


LYS 


A 


264 


-2.822 


18.400 


45.468 


1.00 


37.95 


746 


NZ 


LYS 


A 


264 


-4.233 


18.872 


45.629 


1.00 


39.41 


747 


N 


SER 


A 


265 


3.047 


21.286 


47.908 


1.00 


26.64 


748 


CA 


SER 


A 


265 


3.712 


22.540 


48.227 


1.00 


27.75 


749 


C 


SER 


A 


265 


5.199 


22.591 


47.884 


1.00 


26.92 


750 


0 


SER 


A 


265 


5.750 


23.676 


47.723 


1.00 


28.28 


751 


CB 


SER 


A 


265 


3.513 


22.881 


49.709 


1.00 


28.81 


753 


N 


SER 


A 


266 


5.847 


21.434 


47.757 


1.00 


25.17 


754 


CA 


SER 


A 


266 


7.277 


21.412 


47.449 


1.00 


23.91 


755 


C 


SER 


A 


266 


7.609 


21.035 


46.011 


1.00 


23.51 


756 


O 


SER 


A 


266 


8.749 


21.206 


45.572 


1.00 


23.30 


757 


CB 


SER 


A 


266 


8.001 


20.445 


48.385 


1.00 


24.45 


759 


N 


ALA 


A 


267 


6.619 


20.519 


45.285 


1.00 


22,67 


760 


CA 


ALA 


A 


267 


6.801 


20.089 


43.898 


1,00 


23.39 


761 


C 


ALA 


A 


267 


7.698 • 


20.979 


43.040 


1.00 


23.51 


762 


0 


ALA 


A 


267 


8.716 


20.517 


42.515 


1.00 


23.55 


763 


CB 


ALA 


A 


267 


5.436 


19.938 


43.217 


1.00 


24.51 


764 


N 


ILE 


A 


268 


7.330 


22.247 


42.883 


1.00 


22.01 


765 


CA 


ILE 


A 


268 


8.132 


23.135 


42.U4T 


i nn 
i .uu 


99 9^ 


766 


C 


ILE 


A 


268 


9.539 


23.374 


42.592 


1.00 


22.05 


767 


0 


ILE 


A 


268 


10.494 


23.558 


41.828 


1.00 


20.90 


768 


CB 


ILE 


A 


268 


7.426 


24.496 


41.811 


1.00 


23.63 


772 


N 


GLU 


A 


269 


9.674 


23.352 


43.911 


1.00 


20.40 


773 


CA 


GLU 


A 


269 


10.979 


23.561 


44.529 


1.00 


20.63 


7?4 


C 


GLU 


A 


269 


11.933 


22.402 


44.268 


1.00 


21.33 


775 


0 


GLU 


A 


269 


13.109 


22.620 


43.976 


1.00 


20.99 
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776 
777 



CB 
CG 



GLU 
GLU 



269 
269 



10.823 
10.206 



23.770 
25.110 



46.030 
46.396 



1.00 
1.00 



20.38 
22.10 



778 



CD 



GLU 



269 



10.009 25.261 



47.892 



1.00 



23.72 
22.73 



779 



0E1 



GLU 



269 



10.803 24.670 



48.656 



1.00 



780 



0E2 



GLU 



269 



9.067 



25.974 



48.301 



1.00 



24.58 



781 
782 



N 

CA 



VAL 
VAL 



270 



11.434 



21.172 



44.375 



1.00 



20.39 



A 



270 



12.279 



20.006 



44.143 



1.00 



20.83 



783 



784 



VAL 



270 



12.644 



19.911 



42.670 



1.00 



20.52 



VAL 



270 



13.734 



19.458 



42.318 



1.00 



20.87 
21.55 



785 



CB 



VAL 



270 



11.582 



18.709 



44.597 



1.00 



788 
789 



N 

CA 



ILE 
ILE 



271 



11.731 



20.337 



41.804 



1.00 



20.29 



271 



12.010 



20.318 



40.376 



1.00 



20.71 



790 



ILE 



271 



13.145 



21.300 



40.099 



1.00 



20.86 
20.78 



791 



792 



796 



CB 



ILE 



A 



271 



14.083 



20.990 



ILE 



271 



10.755 



20.684 



MET 



272 



13.076 



22.481 



39.361 
39.563 



1.00 



1.00 



21.89 



40.701 



1.00 



21.17 
21.57 



797 



CA 



MET 



272 



14.147 



23.446 



40.500 



1.00 



798 



799 



800 
804 



CB 
N 



MET 



272 



15.474 



22.888 



41.020 



1.00 



20.82 



MET 



272 



16.513 



23.064 



40.384 



1.00 



22.20 



MET 



272 



13.800 



24.770 



41.183 



LEU 



273 



15.442 



22.204 



42.163 



1.00 



22.31 



1.00 



21.17 
21.28 
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24.40 


1946 


CD2 


1 CI 1 

LtU 


A 

A 


417 


1 .766 


14.470 


37.882 


1.00 


25.23 


1947 


N 


W A 1 

VAL 


A 




*U.OuU 


18.711 


36.500 


1.00 


24.66 


1948 


CA 


VAL 


A 


A A Q 
H I O 


n 097 


20.080 


36.640 


1.00 


25.57 


1949 


C 


VAL 


A 

A 


H I O 


-1 176 


20.971 


37.111 


1.00 


26.33 


1950 


O 


VAL 


A 

A 


/M f* 
4lO 


-1 nm 


21 814 


37.991 


1.00 


27.09 


1951 


CB 


\ / A 1 

VAL 


A 

A 


Zip 


U.oO 1 


20.599 


35.297 


1.00 


25.14 


1954 


N 


LEU 


A 

A 


A 4. Q 


9 ^4 

-Z.Oo'r 


20.769 


36.530 


1.00 


26.33 


1955 


CA 


LEU 


A 

A 


4iy 


^9R 


21.556 


36.902 


1.00 


27.78 


1956 


C 


i m i 

LEU 


A 

A 


/MCI 

4 iy 


Q OC<* 
"J.OU 1 


21 .399 


38.382 


1.00 


29.03 


1957 


o 


LbU 


A 

A 


i y 


-4 906 


22.370 


39.052 


1.00 


30,30 


1958 


CB 


1 CI 1 

LtU 


A 

A 


A1Q 

**■ t y 


-4 7?^ 


21.143 


36.051 


1.00 


28.60 


1962 


N i 


GLU 


A 

A 


A90 
*f zu 


o . / oo 


20.184 


38.904 


1.00 


29.76 


1963 


CA 


C3LU 


A 

A 


A90 


-4 O'SR 


19.962 


40.307 


1.00 


31.06 


1964 


C 


CjLU 


A 

A 


490 


Q 010 

O . \J 1 w 


20.514 


41.268 


1.00 


30.59 


1965 


O 


GLU 


A 

A 


A90 


Q OA4 


21.184 


42.245 


1.00 


30.30 


1966 


CB 


GLU 


A 

A 


A90 


-4 9^7 


18.478 


40.605 


1.00 


32.62 


1967 


CG 


GLU 


A 

A 




4 RQ7 


18 251 


42.037 


1.00 


36.69 


1968 


CD 


GLU 


A 

A 


AOO 


-4 9R7 


16.919 


42.598 


1.00 


38.47 


1969 


OE1 


GLU 


A 

A 


/ion 

42U 


A R^1 


16 624 


43.756 


1.00 


40.46 


1970 


OE2 


GLU 


A 

A 


/ion 


^ ^R1 

-O.OD I 


16.171 


41.891 


1.00 


41.56 


1971 


N 


VAL 


A 

A 




1 744 


90 223 


40.992 


1.00 


31.25 


1972 


CA 


VAL 


A 

A 


/I O -1 


-U.DDO 


90 


41.855 


1.00 


32.00 


1973 


C 


VAL 


A 

A 


421 


0 RAA 


99 1Q1 


41.960 


1.00 


32.63 


1974 


O 


VAL 


A 

A 


421 


-U.ooo 


99 794 


43.051 


1.00 


32.82 


1975 


CB 


VAL 


A 


AO*. 


O RQA 


90 0R2 


41.395 


1.00 


31.60 


1978 


N 


PHE 


A 

A 


/I oo 
422 


0 fi70 


99 RQ0 


40.839 


1.00 


33.47 


1979 


CA 


PHE 


A 

A 


422 


n c;ai 

-U.Oh- I 


94 ?49 


40.857 


1.00 


34.95 


1980 


C 


PHE 


A 

A 


/I oo 


- t .ouu 


25.089 


40.872 


1.00 


35.99 


1981 


O 


n lj cr 

PHt 


A 

A 


A99 


-1 Q07 


26.284 


41.159 


1.00 


36.37 


1982 


CB 


nur 

PHE 


A 

A 


/ OO 


O ^10 


24.794 


39.670 


1.00 


34.60 


1983 


CG 


PHE 


A 

A 


/too 
42Z 


1 R7Q 


24 182 


39.656 


1.00 


34.76 


1984 


CD1 


PHE 


A 

A 


/t OO 
422 


9 OQ^ 


9^ 


38.592 


1.00 


34.88 


1985 


CD2 


nur 

PHt 


A 

A 


/too 




24.369 


40.728 


1.00 


35.16 


1986 


CE1 


PHt 


A 

A 


AOO 


^ ^4R 


22.790 


38.597 


1.00 


34.92 


1987 


CE2 


nur 

PHt 


A 

A 


AOO 


^ R01 


23.774 


40.743 


1.00 


34.70 


1988 


CZ 


PHt 


A 
A 


AOO 


4.202 


22.982 


39.674 


1.00 


34.73 


1989 


K 1 


oLY 


A 


49** 


-2.946 


24.378 


40.570 


1.00 


37.39 


1990 


CA 


bLY 


A 

A 


A9^ 
4Zo 


-4.261 


24.993 


40.564 


1.00 


38.89 


1991 


/•** 


ULY 


A 
M 


A9^ 


-4.914 


24.907 


41.930 


1.00 


39.70 


1992 


O 


ULY 


A 
A 


49^ 


-5.857 


24.099 


42.083 


1.00 


40.52 


1993 




A 
















1994 


02 


VDX 


425 


17.029 


18.071 


34.819 


1.00 


21.73 


0 


1995 


03 


VDX 


425 


4.489 


26.946 


35.054 


1.00 


24.67 


O 


1996 


C1 


VDX 


425 


14.139 


17.953 


35.755 


1.00 


20.80 


C 


1997 


C2 


VDX 


425 


14.879 


16.893 


34.895 


1.00 


21.02 


C 


1998 


C3 


VDX 


425 


15.992 


17.534 


33.962 


1.00 


21.41 


C 


1999 


C4 


VDX 


425 


15.368 


18.672 


33.049 


1.00 


21.29 


C 


2000 


C5 


VDX 


425 


14.622 


19.724 


33.864 


1.00 


21. 0C 


C 


I 2001 


C6 


VDX 


A2i 


) 14.797 


21.12C 


I 33.792 


. 1.00 


20.9E 


) C 
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2002 


C7 


VDX 


425 


14.174 


22.286 


O A CA A 

34.514 


1 nn 
1 ,UU 


91 0*\ 


c 


2003 


C8 


VDX 


425 


13.966 


23.488 


O A n A O 

34.042 


1 nn 
i .UU 


C. I .OH 


c 


2004 


C9 


VDX 


425 


14.354 


23.927 


32.544 


a, nn 
1 .UU 


r >^ 77 

Z I ./ / 


c 


2005 


C10 


VDX 


425 


13.602 


19.075 


34.828 


i nn 
\ .UU 


OC\ 1A 
ZU. I *t 




2006 


C11 


VDX 


425 


13.088 


24.490 


31.671 


a nn 
1 .UU 


91 fifi 
Z \ .DO 




2007 


C12 


VDX 


425 


12.147 


25.443 


32.564 


A f\C\ 

\ .UU 


99 V\A 
ZZ.U** 


p 


2008 


C13 


VDX 


425 


11.753 


24.897 


34.070 


1 .OU 


ZZ. Ul 


p 


2009 


C14 


VDX 


425 


13.148 


24.538 


34.777 


1 .00 


zl .oU 


p 


2010 


C15 


VDX 


425 


12.661 


24.266 


36.350 


1.00 




p 


2011 


C16 


VDX 


425 


11.429 


25.231 


36.497 


1.00 


OO OQ 

zz.39 


p 


2012 


C17 


VDX 


425 


11.276 


25.934 


35.106 


1.00 


on o a 

22.31 


p 


2013 


C18 


VDX 


425 


10.769 


23.570 


33.779 


1.00 


o a cn 
21.50 


p 


2014 


C19 


VDX 


425 


12.291 


19.455 


34.852 


1.00 


on 77 

20.77 


p 


2015 


C20 


VDX 


425 


9.849 


26.546 


34.726 


1 .00 


oo nn 
2z.9U 


p 


2016 


C21 


VDX 


425 


9.804 


27.956 


35.482 


1.00 


23.00 


p 


2017 


C22 


VDX 


425 


8.575 


25.824 


35.268 


1 .00 


Zo.TO 


p 


2018 


C23 


VDX 


425 


7.331 


26.060 


34.405 


1.00 


OO *70 

2.6.(6 


p 


2019 


C24 i 


VDX 


425 


6.152 


25.266 


34.672 


1.00 


24.3b 


p 


2020 


C25 


VDX 


425 


4.775 


25,776 


34.336 


1.00 


Ovi 7C 

Z4./ 0 


p 


2021 


C26 


VDX 


425 


4.701 


26.010 


32.842 


1.00 


OC A A 

25.41 


p 


2022 


C27 


VDX 


425 


3.668 


24.730 


34.723 


1,00 


on on 
Z0.39 


p 
\u 


2023 


01 


VDX 


425 


13.119 


17.359 


36.620 


1.00 


on co 

20.68 


U 


2024 


0 


HOH 


500 


14.347 


10.333 


30.796 


1.00 


24.33 


Pi 


2025 


0 


HOH 


501 


13.828 


12.782 


35.922 


1.00 


0<( AC 

Z1.46 


pi 


2026 


O 


HOH 


502 


13.846 


14.468 


42.856 


1.00 


24,78 


U 


2027 


O 


HOH 


503 


19.132 


15.890 


40.266 


1.00 


O A 0"7 

21.27 


pi 


2028 


O 


HOH 


504 


15.013 


12.029 


41.977 


1.00 


oo en 

22. 69 


p» 


2029 


O 


HOH 


505 


13.766 


10.118 


35.125 


1.00 


20.29 


p 


2030 


O 


HOH 


506 


16.290 


13.157 


34.345 


1.00 


on n*7 

30.57 


p» 


2031 


O 


HOH 


507 


5.938 


22.747 


23.179 


1.00 


24.25 


p\ 


2032 


0 


HOH 


508 


13.771 


7.592 


35.963 


1.00 


28.23 


pt 


2033 


0 


HOH 


509 


12.348 


25.386 


50.763 


1.00 


30.93 


p\ 


2034 


o 


HOH 


510 


28.498 


23.703 


34.824 


1.00 


37.09 


p 


2035 


o 


HOH 


511 


26.394 


10.521 


64.086 


1.00 


on co 

3U.00 


p* 


2036 


0 


HOH 


512 


20.573 


9.150 


38.613 


1.00 


30.36 


p* 


2037 


o 


HOH 


513 


19.724 


30.629 


29.203 


1.00 . 


O C Af\ 

35.40 


U 


2038 


o 


HOH 


514 


4.372 


27.504 


42.595 


1.00 


O 4 AC 

31 .4b 


Pt 


2039 


o 


HOH 


515 


2.808 


13.423 


33.286 


1.00 


30.93 


p. 


2040 


0 


HOH 


516 


23.698 


20,154 


43.135 


1.00 


o ~7 no 

3/.9Z 




2041 


0 


HOH 


517 


11.325 


5.901 


37.588 


1.00 


on ao 
30.12 


p\ 


2042 


o 


HOH 


518 


0.885 


13.049 


59.537 


1 .00 


on oo 

39.32 


vJ 


2043 


o 


HOH 


519 


20.338 


11.515 


62.065 


a nn 
1 .UU 


A O 

OO. I 0 


n 


2044 


o 


HOH 


520 


8.913 


6.134 


53.451 


1 .00 


A A 07 

44.3/ 


pi 


2045 


o 


HOH 


521 


4.924 


23.321 


44.129 


^ nn 
\ .UU 


^A 




2046 


o 


HOH 


522 


16.547 


6.409 


36.375 


a nn 
1 .UU 


62.. f u 


Pi 


2047 


0 


HOH 


523 


8.896 


35.918 


45.789 


1.00 


A C 70 

40. (6 


pi 


2048 


0 


HOH 


524 


26.192 


21.542 


43.420 


^ nn 
1 .UU 


Zo.OD 


o 


oa a n 

204y 


u 


nUil 


^9^ 




32.214 


23.915 


1.00 


35.31 


0 


2050 


0 


HOH 


526 


9.488 


15.901 


22.976 


1.00 


29.33 


0 


2051 


0 


HOH 


527 


5.345 


31.465 


22.796 


1.00 


31.37 


0 


2052 


0 


HOH 


528 


6.982 


20.227 


51.589 


1,00 


32.20 


0 


2053 


0 


HOH 


529 


4.642 


13.886 


30.953 


1.00 


31.71 


0 


2054 


0 


HOH 


530 


-3.764 


29.115 


25.550 


1.00 


37.63 


0 


2055 


0 


HOH 


531 


31.831 


9.097 


66.550 


1.00 


36,20 


0 


2056 


o 


HOH 


532 


10.178 


6.595 


32.965 I 1.00 


30.94 


0 
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2057 


0 


HOH 


533 


-1.561 


14.197 


34.245 


1.00 


33.20 


r\ 
U 


2058 


0 


HOH 


534 


0.476 


12.154 


62.160 


1.00 


39.93 


r\ 
\J 


2059 


0 


HOH 


535 


25.970 


5.142 


53.011 


1.00 


47.31 


r\ 
U 


2060 


0 


HOH 


536 


8.695 


5.045 


44.801 


1.00 


38.39 


r\ 
U 

/— \ 


2061 


0 


HOH 


537 


22.396 


11.047 


39.112 


1.00 


40.45 


u 


2062 


0 


HOH 


538 


13.975 


29.983 


22.553 


1.00 


36.21 


(J 
/~\ 


2063 


0 


HOH 


539 


-6.673 


18.195 


37.122 


1.00 


36.41 


o 


2064 


0 


HOH 


540 


15.926 


27.813 


55.197 


1.00 


43.43 


o 


2065 


0 


HOH 


541 


21.922 


29.786 


26.625 


1.00 


39.42 


O 


2066 


0 


HOH 


542 


29.079 


22.924 


57.335 


1.00 


43.49 


o 


2067 


o 


HOH 


543 


-8.883 


26.986 


29.744 


1.00 


47.42 


0 


2068 


o 


HOH 


544 


-2.789 


31.232 


23.837 


1.00 


38.14 


0 


2069 


o 


HOH 


545 


15.578 


33.329 


45.128 


1.00 


39.44 


O 


2070 


0 


HOH 


546 


20.810 


2.660 


42.920 


1.00 


51.44 


O 


2071 


o 


HOH 


547 


27.448 


25.982 


58.310 


1.00 


43.04 


O 


2072 


0 


HOH 


548 


21.987 


8.152 


64.287 


1.00 


43.15 


O 


2073 


0 


HOH 


549 


14.435 


13.091 


64.840 


1.00 


35.87 


O 


2074 


O i 


HOH 


550 


1.276 


25.772 


21 .944 


1.00 


40.66 


O 


2075 


0 


HOH 


551 


14.102 


6.513 


31.763 


1.00 


43.70 


o 


2076 


0 


HOH 


552 


11.990 


24.017 


53.147 


1.00 


45.62 


o 


2077 


0 


HOH 


553 


3.481 


24.236 


20.666 


1.00 


35.69 


O 


2078 


0 


HOH 


554 


24.054 


13.110 


35.770 


1.00 


37.92 


o 


2079 


0 


HOH 


556 


6.857 


37.182 


44.351 


1.00 


49.60 


o 


2080 


0 


HOH 


557 


-8.644 


30.901 


30.925 


1.00 


41.21 


o 


2081 


0 


HOH 


558 


17.767 


33.571 


43.159 


1.00 


37.66 


O 


2082 


0 


HOH 


559 


16.954 


26.537 


23.238 


1.00 


51.77 , 


o 


2083 


0 


HOH 


560 


27.386 


20.638 


40.959 


1.00 


37.25 


O 


2084 


o 


HOH 


561 


31.418 


1.0.182 


50.496 


1.00 


47.27 


O 


2085 


0 


HOH 


562 


4.082 


21.082 


20.610 


1.00 


37.94 


O 


2086 


0 


HOH 


563 


14.064 


10.706 


58.224 


1.00 


42.75 


O 


2087 


0 


HOH 


564 


23.415 


29.835 


49.803 


1.00 


45.77 


O 


2088 


0 


HOH 


565 


14.533 


11.393 


24.395 


1.00 


36.60 


O 


2089 


0 


HOH 


566 


-0.868 


36.798 


40.025 


1.00 


52.17 


O 


2090 


0 


HOH 


567 


2.865 


34.386 


33.570 


1.00 


42.56 


O 


2091 


0 


HOH 


568 


-4.893 


19.288 


30.751 


1.00 


44.30 


O 


2092 


0 


HOH 


569 


30.643 


14.674 


61.949 


1.00 


43.28 


O 


2093 


0 


HOH 


570 


22.702 


3.372 


47.417 


1.00 


36.93 


o 


2094 


0 


HOH 


571 


13.379 


35.172 


44.109 


1.00 


47.38 


O 


2095 


0 


HOH 


572 


-1.138 


20.698 


22.966 


1.00 


53.61 


O 


2096 


0 


HOH 


573 


25.589 


19.849 


33.401 


1.00 


52.13 


O 


2097 


0 


HOH 


574 


23.893 


13.360 


32.579 


1.00 


45.26 


0 


2098 


0 


HOH 


575 


-7.367 


18.485 


31.944 


1.00 


48.23 


0 


2099 


0 


HOH 


576 


2.430 


19.200 


65.790 


1.00 


45.13 


o 


2100 


0 


HOH 


577 


20.048 


32.028 


44.907 


1.00 


46.82 


0 


2101 


0 


HOH 


578 


20.286 


6.713 


37.519 


1.00 


43.08 


o 


2102 


0 


HOH 


579 


25.879 


5.448 


50.403 


1.00 


48.82 


u 


2103 


0 


HOH 


580 


24.905 


19.763 


39.659 


1.00 


45.39 


u 


2104 


0 


HOH 


581 


2.o41 




OR HR9 


1 .00 


50.76 


0 


2105 


0 


HOH 


582 


15.248 


.20.000 


60.506 


1.00 


44.08 


0 


2106 


0 


HOH 


583 


22.695 


7.038 


37.715 


1.00 


46.55 


0 


2107 


0 


HOH 


584 


11.915 


16.625 


66.479 


1.00 


52.58 


0 


2108 


0 


HOH 


585 


20.145 


35.730 


35.936 


1.00 


46.90 


0 


2109 


0 


HOH 


586 


10.735 


24.933 


16.684 


1.00 


46.64 


0 


2110 


0 


HOH 


587 


1.182 


9.495 


61.83C 


1.00 


55.88 


0 


2111 


0 


HOH 


58€ 


\ -3.993 


16.527 


51 .745 


> 1.00 


43.32 


o 
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2112 


0 


HOH 


589 


21.842 


29.919 


56.624 


1.00 


/lO 17 

4z.l / 


o 


2113 


0 


HOH 


590 


3.602 


25.520 


44.494 


a r\r\ 

1 .00 


0U.Z4 


r\ 
\J 


2114 


0 


HOH 


591 


1.198 


23.984 


44.777 


a r\r\ 

1 .uu 


A 1 7R 

4o. i o 


n 


2115 


0 


HOH 


592 


13.208 


27.713 


54.123 


a pi r\ 

1.00 


03.1 f 


r\ 
\J 


2116 


0 


HOH 


593 


27.958 


7.530 


50.434 


1.00 


00.00 


V 


2117 


0 


HOH 


594 


22.594 


3.510 


64.140 


1.00 


4o.bb 


r\ 
U 


2118 


0 


HOH 


595 


30.412 


22.979 


36.623 


1.00 


71.37 


r\ 
U 


2119 


0 


HOH 


596 


10.560 


15.906 


20.574 


1.00 


50.32 


U 


2120 


0 


HOH 


597 


26.021 


3.241 


64.667 


1.00 


49.85 


O 


2121 


0 


HOH 


598 


19.853 


9.062 


62.967 


1.00 


56.45 


O 


2122 


0 


HOH 


599 


12.462 


3.992 


52.363 


1.00 


42.46 


0 


2123 O 


HOH 


600 


6.152 


35.657 


28.721 


1.00 


46.87 


o 


91 94 


o 


HOH 


601 


7.626 


29.983 


53.085 


1.00 


51.73 


0 


919 c . 

Z I Z\J 


n 
\j 


HOH 


602 


11.547 


23.591 


57.064 


1.00 


51.07 


o 


9*1 9fi 
Z \ ZO 




HOH 


603 


24.407 


19.393 


31.035 


1.00 


53.85 


o 


9*197 

Z 1 Z / 


o 


HOH 


604 


12.538 


23.006 


18.706 


1.00 


50.11 


o 


OA OR 
Z \ Z\0 


n 


HOH 


605 


1.839 


16.469 


66.997 


1.00 


49.40 


o 


91 9Q 

z i 


n i 


HOH 


606 


1.378 


19.964 


21.070 


1.00 


48.97 


o 


91 °.n 
Z I ou 


w 


HOH 


607 


5.895 


26.935 


51.419 


1.00 


53.95 


o 


91 **1 
Z I O I 


n 


HOH 


608 


13.122 


33.698 


19.464 


1.00 


52.90 


0 


91 °.9 
Z I OZ, 


n 


HOH 


609 


27.040 


8.636 


44.102 


1.00 


44.22 


0 


91 ^ 
Z I OO 


n 


HOH 


610 


18.833 


30.775 


55.879 


1.00 


54.75 


0 


91 ^4 
Z 1 0*4- 


n 


HOH 


611 


34.509 


17.720 


47.771 


1.00 


42.84 


0 


91 ^ 
Z I OO 


n 


HOH 


612 


18.356 


32.644 


25.579 


1.00 


42.52 


0 


91 ^R 
Z t OO 


n 


HOH 


613 


-2.259 


16.235 


28.804 


1.00 


56.71 


0 


91 °»7 
Z ( Of 




HOH 


614 


16.400 


38.404 


21.700 


1.00 


46.19 


0 


Z I OO 


n 


HOH 


615 


9.340 


39.540 


19.060 


1.00 


51.44 


0 


91 

z i oy 


o 


HOH 


616 


20.026 


35.074 


32.855 


1.00 


47.06 


0 


914D 


o 


HOH 


617 


31.604 


8.486 


59.428 


1.00 


47.99 


0 


9141 

Z 1 *T 1 




HOH 


618 


26.228 


8.975 


40.708 


1.00 


47.20 


0 


9149 


o 


HOH 


619 


0.460 


15.378 


28.064 


1.00 


50.21 


0 


2143 


o 


HOH 


620 


15.771 


3.385 


48.139 


1.00 


38.09 


0 


2144 


o 


HOH 


621 


25.135 


17.914 


42.644 


1.00 


60.05 


0 


2145 


o 


HOH 


622 


-2.286 


29.197 


21.618 


1.00 


53.99 


0 


2146 


0 


HOH 


623 


32.865 


18.926 


45.658 


1.00 


48.11 


0 


2147 


0 


HOH 


624 


17.116 


13.333 


25.240 


1.00 


52.60 


0 


2148 


0 


HOH 


625 


-2.809 


17.978 


56.255 


1.00 


53.36 


o 


2149 


0 


HOH 


626 


-3.647 


7.885 


56.347 


1.00 


63.91 


0 


2150 


0 


HOH 


627 


17.746 


24,596 


21.608 


1.00 


59.81 


0 


2151 


0 


HOH 


628 


28.368 


5.841 


47.861 


1.00 


66.08 


o 


2152 


o 


HOH 


629 


13.641 


11.618 


66.858 


1.00 


52.02 


- o 


2153 


0 


HOH 


630 


8.052 


20.893 


16.742 


1.00 


53.91 


u 


2154 


0 


HOH 


631 


8.914 


38.015 


27.578 


1.00 


56.47 


u 


2155 


0 


HOH 


632 


9.081 


13.482 


19.627 


1.00 


57.14 


u 


2156 


0 


HOH 


633 


-4.343 


24.969 


37.694 


1.00* 


51 .08 


u 


2157 


0 


HOH 


634 


3.597 


28.859 


46.576 


1.00 


54.80 


u 


2158 


0 


HOH 


635 


27.905 


21.432 


28.373 


1.00 






2159 


o 


now 


R?R 
DOD 


A O^O 


18 ^7 


25.491 


1.00 


47.50 


0 


2160 


0 


HOH 


637 


-2.808 


23.046 


51.839 


1.00 


49.04 


o 


2161 


0 


HOH 


638 


2.757 


25.756 


18.437 


1.00 


49.80 


0 


2162 


0 


HOH 


639 


15.470 


7.390 


63,803 


1.00 


52.42 


0 


2163 


0 


HOH 


640 


33.689 


11.757 


50.784 


1.00 


54.00 


0 


2164 


0 


HOH 


641 


6.223 


13.352 


20.927 


1.00 


49.77 


0 


2165 


o 


HOH 


642 


12.267 


32.764 


51.605 


1.00 


48.76 


0 


2166 


0 


HOH 


644 


25.211 


3.585 


48.391 


1.00 


49.75 


0 
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2167 


0 


HOH 


645 


0.619 


24.002 


51.358 


1.00 


49.46 


O 


2168 


0 


HOH 


646 


12.270 


22.627 


60.617 


1.00 


63.88 


O 


2169 


0 


HOH 


647 


0.202 


23.805 


47.834 


1.00 


52.54 


0 


2170 


O 


HOH 


648 


15.471 


8.169 


23.816 


1.00 


54.49 


o 


2171 


0 


HOH 


649 


4.098 


13.117 


28.105 


1.00 


43.97 


0 


2172 


O 


HOH 


650 


16.032 


4.857 


59.064 


1,00 


55.67 


0 


2173 


0 


HOH 


651 


-5.591 


11.911 


55.960 


1.00 


63.35 


0 


2174 


O 


HOH 


652 


14.373 


4.083 


36.218 


1.00 


49.18 


0 


2175 


0 


HOH 


653 


11.138 


5.501 


59.825 


1.00 


51.19 


o 


2176 


0 


HOH 


654 


26.262 


1.299 


50.288 


1.00 


61.20 


0 


2177 


0 


HOH 


655 


4.067 


20.751 


67.111 


1.00 


51.75 


0 


2178 


0 


HOH 


656 


11.291 


34.551 


23.646 


1.00 


53.35 


o 


2179 


0 


HOH 


657 


2.505 


33.743 


45.342 


1.00 


58.29 


0 


2180 


0 


HOH 


658 


18.881 


-0.886 


43.452 


1.00 


en oo 

ou.oZ 




2181 


0 


HOH 


659 


-1.930 


13.191 


62.255 


1.00 


65.05 


o 


2182 


0 


HOH 


660 


-3.587 


12.153 


34.625 


1.00 


51.24 


o 


2183 


0 


HOH 


661 


-2.064 


26.008 


58.110 


1.00 


58.94 


o 


2184 


0 1 


HOH 


662 


18.842 


12.351 


64.527 


1.00 


60.06 


o 


2185 


0 


HOH 


663 


30.991 


26.420 


51.105 


1.00 


54.69 


o 


2186 


0 


HOH 


664 


16.115 


30.354 


56.207 


1.00 


60.96 


0 


2187 


O 


HOH 


665 


36.596 


19.242 


55.988 


1.00 


55.83 


o 



It will be understood that various details of the invention can be 
changed without departing from the scope of the invention. Furthermore, the 
5 foregoing description is for the purpose of illustration only, and not for the 
purpose of limitation— the invention being defined by the claims. 
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CLAIMS 

What is claimed is: 

1. A substantially pure PXR ligand binding domain polypeptide in 

crystalline form. 

5 

2. The polypeptide of claim 1, wherein the crystalline form has 
lattice constants of a = 91.6 A, b = 91.6 A, c = 85.0 A, a = 90°, B = 90°, y = 
90°. 

1 o 3. The polypeptide of claim 1 or 2, wherein the crystalline form is a 

tetragonal crystalline form. 

4. The polypeptide of claim 1 or 2, wherein the crystalline form has 
a space group of P432i2. 

15 

5. The polypeptide of claim 1 or 2, wherein the PXR ligand binding 
domain polypeptide has the amino acid sequence shown in SEQ ID NO: 4. 

6. The polypeptide of claim 1 or 2, wherein the PXR ligand binding 
20 domain polypeptide is in complex with a ligand. 

7. The polypeptide of claim 6, wherein the ligand is a drug. 

8. The polypeptide of claim 7, wherein the drug is SR12813. 

25 

9. The polypeptide of claim 1 or 2, wherein the PXR ligand binding 
domain has a crystalline structure further characterized by the coordinates 
corresponding to Table 4. 

30 10. The polypeptide of claim 1 or 2, wherein the crystalline form 

contains one PXR ligand binding domain polypeptide in the asymmetric unit. 
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11. The polypeptide of claim 1 or 2, wherein the crystalline form is 
such that the three-dimensional structure of the crystallized hPXR ligand 
binding domain polypeptide can be determined to a resolution of about 2.75 A 
or better. 

5 

12. The polypeptide of claim 1 or 2, wherein the crystalline form 
contains one or more atoms having an atomic weight of 40 grams/mol or 
greater. 

10 13. A method for determining the three-dimensional structure of a 

crystallized hPXR ligand binding domain polypeptide to a resolution of about 
3.0 A or better, the method comprising: 

(a) crystallizing a hPXR ligand binding domain polypeptide; and 

(b) analyzing the hPXR ligand binding domain polypeptide to 
15 determine the three-dimensional structure of the crystallized 

hPXR ligand binding domain polypeptide, whereby the three- 
dimensional structure of a crystallized hPXR ligand binding 
domain polypeptide is determined to a resolution of about 2.75 A 
or better. 

20 

14. The method of claim 13, wherein the analyzing is by X-ray 
diffraction. 

15. The method of claim 13, wherein the crystallization is 
25 accomplished by the hanging drop vapor diffusion method, and wherein the 

PXR ligand binding domain is mixed with an equal volume of reservoir. 



30 



16. The method of claim 15, wherein the reservoir comprises 250 
mM NaCI, 20 mM Tris-HCI pH 7.8, 5% glycerol (v/v), 5 mM DTT, 2.5 mM 
EDTA. 
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17. The method of claim 15, wherein the reservoir 50 mM imidazole, 
pH 7.4, 10% 2-propanol (v/v). 

18. A method of generating a crystallized PXR ligand binding 
5 domain polypeptide, the method comprising: 

(a) incubating a solution comprising a PXR ligand binding domain 
with an equal volume of reservoir; and 

(b) crystallizing the PXR ligand binding domain polypeptide using 
the hanging drop method, whereby a crystallized PXR ligand 

10 binding domain polypeptide is generated. 

19. A crystallized PXR ligand binding domain polypeptide produced 
by the method of claim 18. 

15 20. A method of designing a modulator of a PXR polypeptide, the 

method comprising: 

(a) designing a potential modulator of a PXR polypeptide that will 
form bonds with amino acids in a ligand binding site based upon 
a crystalline structure of a PXR ligand binding domain 

20 polypeptide; 

(b) synthesizing the modulator; and 

(c) determining whether the potential modulator modulates the 
activity of the PXR polypeptide, whereby a modulator of a PXR 
polypeptide is designed. 

25 

21. A method of designing a modulator that selectively modulates 
the activity of a human PXR polypeptide, the method comprising: 

(a) obtaining a crystalline form of a human PXR ligand binding 
domain polypeptide; 

30 (b) evaluating the three-dimensional structure of the crystallized 

human PXR ligand binding domain polypeptide; and 
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(c) synthesizing a potential modulator based on the three- 
dimensional crystal structure of the crystallized human PXR 
ligand. binding domain polypeptide, whereby a modulator that 
selectively modulates the activity of a human PXR polypeptide is 
5 designed. 

22. The method of claim 21 , wherein the method further comprises 
contacting a human PXR ligand binding domain polypeptide with the potential 
modulator; and assaying the human PXR ligand binding domain polypeptide 

10 for binding of the potential modulator, for a change in activity of the human 
PXR ligand binding domain polypeptide, or both. 

23. The method of claim 21, wherein the crystalline form is in 
tetragonal form. 

15 

24. The method of claim 23, wherein the crystalline form is such that 
the three-dimensional structure of the crystallized human PXR ligand binding 

' domain polypeptide can be determined to a resolution of about 2.75 A or 
better. 

25. A method for identifying a PXR modulator, the method 
comprising: 

(a) providing atomic coordinates of a PXR ligand binding domain to 
a computerized modeling system; and 
25 (b) modeling a ligand that fits spatially into a binding cavity or on the 

surface of the PXR ligand binding domain, whereby a PXR 
modulator is identified. 



20 



26. The method of claim 25, wherein the method further comprises 
30 identifying in an assay for PXR-mediated activity a modeled ligand that 
increases or decreases the activity of the PXR. 
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27. A method of identifying a PXR modulator that selectively 
modulates the activity of a PXR polypeptide compared to other polypeptides, 
the method comprising: 

(a) providing atomic coordinates of a PXR ligand binding domain to 
5 a computerized modeling system; and 

(b) modeling a ligand that fits spatially into a binding cavity or on the 
surface of a PXR ligand binding domain and that interacts with 
conformationally constrained residues of a PXR that are 
conserved among PXR orthologs and isoforms, whereby a PXR 

10 modulator is identified. 

28. The method of claim 27, wherein the method further comprises 
identifying in a biological assay for PXR-mediated activity a modeled ligand 
that selectively binds to the PXR ligand binding domain and increases or 

1 5 decreases the activity of the PXR. 

29. A method of designing a modulator of a PXR polypeptide, the 
method comprising: 

(a) selecting a candidate PXR ligand; 

20 (b) determining which amino acid or amino acids of a PXR 

polypeptide interact with the ligand using a three-dimensional 
model of a crystallized protein comprising a PXR LBD; 
(c) identifying in a biological assay for PXR activity a degree to 
which the ligand modulates the activity of the PXR polypeptide; 

25 (d) selecting a chemical modification of the ligand wherein the 

interaction between the amino acids of the PXR polypeptide and 
the ligand is predicted to be modulated by the chemical 
modification; 

(e) performing the chemical modification on the ligand to form a 
30 modified ligand; 

(f) contacting the modified ligand with the PXR polypeptide; 

(g) identifying in a biological assay for PXR activity a degree to 
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which the modified ligand modulates the biological activity of the 
PXR polypeptide; and 
(h) comparing the biological activity of the PXR polypeptide in the 
presence of modified ligand with the biological activity of the 
5 PXR polypeptide in the presence of the unmodified ligand, 

whereby a modulator of a PXR polypeptide is designed. 

30. The method of claim 29, wherein the PXR polypeptide is a 
human PXR polypeptide" 

10 

31 . the method of claim 29, wherein the three-dimensional model of 
a crystallized protein is a human PXR LBD polypeptide with a bound ligand. 

32. The method of claim 31 , wherein the ligand is a drug. 

15 

33. The method of claim 32, wherein the drug is SR12813. 

34. The method of claim 29, wherein the method further comprises 
repeating steps (a) through (f), if the biological activity of the PXR polypeptide 

20 in the presence of the modified ligand varies from the biological activity of the 
PXR polypeptide in the presence of the unmodified ligand. 



35. An assay method for identifying a compound that inhibits binding 
of a ligand to a PXR polypeptide, the assay method comprising: 
25 (a) designing a test inhibitor compound capable of modulating PXR 

activity, based on the atomic coordinates of a PXR ligand 
binding domain; 

(b) synthesizing the test inhibitor compound; 

(c) incubating a PXR polypeptide with a ligand in the presence of a 
30 test inhibitor compound; 

(d) determining an amount of ligand that is bound to th.e. PXR 
polypeptide, wherein decreased binding of ligand io the PXR 
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protein in the presence of the test inhibitor compound relative to 
binding of ligand in the absence of the test inhibitor compound is 
indicative of inhibition; and 
(e) identifying the test compound as an inhibitor of ligand binding if 
5 decreased ligand binding is observed, whereby a compound that 

inhibits binding of a ligand to a PXR polypeptide is identified. 

36. The method of claim 35, wherein the ligand is a drug. 
10 37. The method of claim 36, wherein the drug is SR1 28 13. 

38. A method of evaluating a candidate therapeutic agent in humans 
using a mouse model system, the method comprising: 

(a) providing atomic coordinates of a human PXR ligand binding 
15 domain to a computerized modeling system; 

(b) modeling a candidate therapeutic agent that fits spatially into a 
binding cavity or on the surface of a human PXR ligand binding 
domain; 

(c) providing a mouse PXR polypeptide; 

20 (d) selecting one or more mutations to be introduced into an amino 

acid sequence of the mouse PXR polypeptide, the mutations 
being selected so as to alter the mouse PXR polypeptide to be 
similar to a human PXR polypeptide; 

(e) providing a mutant mouse PXR polypeptide comprising the one 
25 or more mutations selected in step (d); 

(f) contacting the candidate therapeutic agent modeled in step (b) 
with the mutant mouse PXR polypeptide; 

(f) determining an effect of the candidate therapeutic agent on the 
mutant mouse PXR polypeptide; and 
30 (g) evaluating the potential of the candidate therapeutic agent for 

use in humans based on the effect of the candidate therapeutic 
agent on the mutant mouse PXR polypeptide, whereby the 
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candidate therapeutic agent in humans using a mouse model 
system is evaluated. 

39. The method of claim 38, further comprising repeating steps (a) 
5 through (h) a desired number of times. 

40. The method of claim 38, wherein the one or more mutations is 
selected from the group consisting of an arginine to leucine substitution at 
residue 203 of the mouse PXR polypeptide, a protein to serine substitution at 

10 residue 205 of the mouse PXR polypeptide, a glutamine to histidine 
substitution atVesidue 404 of the mouse PXR polypeptide and a glutamine to 
arginine substitution at residue 407 of the mouse PXR polypeptide. 

41. The method of claim 38, wherein the mouse PXR polypeptide 
15 comprises a mouse PXR ligand binding domain (LBD). 

42. The method of claim 38, wherein the mutant mouse PXR 
polypeptide is expressed by: 

(a) introducing the selected mutation into the mouse PXR DNA 
20 sequence, thereby forming a mutant mouse PXR DNA 

sequence; 

(b) transforming cells with the mutant mouse PXR DNA sequence; 
and 

(c) expressing the mutant mouse PXR DNA sequence, thereby 
25 forming a mutant mouse PXR polypeptide. 

43. The method of claim 42, wherein the mutant mouse PXR 
polypeptide is expressed in cells. 



30 



44. The method of claim 43, wherein the cells are CV-1 cells. 
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45. The method of claim 42, wherein the mutant mouse PXR DNA 
sequence is cotransformed with a reporter gene. 

46. The method of claim 45, wherein the reporter gene is an XREM- 
5 CYP3A4-LUC reporter. 



47. The method of claim 45, wherein the mouse PXR polypeptide 
comprises a mouse PXR ligand binding domain (LBD). 
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□ W223A.Y225A-SR 
• WT-RIF 

■ W223A.Y225A-R1F 
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<110> SmithKline Beecham Corporation 
Redinbo, Matthew 
Wat kins, Ryan 
Wisely, George 
Williams, Shawn 
Kliewer, Steven 



<120> Crystallized Human Xenobiotic Nuclear Receptor E*R/SXR 
Ligand Binding domain Polypeptide and Screening Methods Employing 



Same 



<130> PU4460 



<160> 13 



<170> Patentln version 3.0 



<210> 1 

<211> 2146 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (304) . . (1608) 



<300> 

<301> Lehmann et al . 

<3 02> The human orphan nuclear receptor PXR is ^^Jj 
compounds that regulate CYP3A4 gene expressxon and cause drug 
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interactions 

<303> Journal of Clinical Investigation 
<304> 102 
<305> 5 
<306> 1016-23 
<307> 1998 

<308> Swiss-Prot/075469; GenBank/AF061056 
<309> 1999-01-20 
<313> (1)..(2146) 



<400> 1 

tgaaatatag gtgagagaca agattgtctc atatccgggg aaatcataac ctatgactag 60 

gacgggaaga ggaagcactg cctttacttc agtgggaatc tcggcctcag cctgcaagcc 120 

- aagtgttcac agtgagaaaa gcaagagaat aagctaatac tcctgtcctg aacaaggcag 180 

cggctccttg gtaaagctac tccttgatcg atcctttgca ccggattgtt caaagtggac 240 

cccaggggag aagtcggagc aaagaactta ccaccaagca gtccaagagg cccagaagca 300 

aac ctg gag gtg aga ccc aaa gaa age tgg aac cat get gac ttt gta 348 
Leu Glu Val Arg Pro Lys Glu Ser Trp Asn His Ala Asp Phe Val 
1 5 1 10 15 

cac tgt gag gac aca gag tct gtt cct gga aag ccc agt gtc aac gca 396 
His Cys Glu Asp Thr Glu Ser Val Pro Gly Lys Pro Ser Val Asn Ala 
20 25 ' 30 

gat gag gaa gtc gga ggt ccc caa ate tgc cgt gta tgt ggg gac aag 444 
Asp Glu Glu Val Gly Gly Pro Gin lie Cys Arg Val Cys Gly Asp Lys 
35 40 45 

gec act ggc tat cac ttc aat gtc atg aca tgt gaa gga tgc aag ggc 492 
Ala Thr Gly Tyr His Phe Asn Val Met Thr Cys Glu Gly Cvs Lys Gly 
50 55 60 

ttt ttc agg agg gec atg aaa cgc aac gec egg ctg agg tgc ccc ttc 540 
Phe Phs Arg Arg Ala Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe 
65 70 75 

egg aag ggc gec tgc gag ate acc egg aag acc egg cga cag tgc cag 588 
Arg Lys Gly Ala Cys Glu He Thr Arg Lys Thr Arg Arg Gin Cys Gin 
80 85 90 95 

gec tgc cgc ctg cgc aag tgc ctg gag age ggc atg aag aag gag atg 636 
Ala Cys Arg Leu Arg Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met 
100 " 105 " 110 

ate atg tec gac gag gec gtg gag gag agg egg gee ttg ate aag egg 684 
He Met Ser Asp Glu Ala Val Glu Glu Arg Arg Ala Leu He Lys Arg 
115 120 125 

aag aaa agt gaa egg aca ggg act cag cca ctg gga gtg cag ggg ctg 732 
Lys Lys Ser Glu Arg Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu 
130 135 140 

aca gag gag cag egg atg atg ate agg gag ctg atg gac get cag atg 780 
Thr Glu Glu Gin Arg Met Met He Arg Glu Leu Met Asp Ala Gin Met 
145 150 155 
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aaa acc ttt gac act acc ttc tec cat ttc aag aat ttc egg ctg cca 

Lys Thr Phe Asp Thr Thr Phe Ser His ?he Lys Asn Phe Arg Leu Pro 

150 165 170 175 

ggg gtg ctt age agt ggc tgc gag ttg cca gag tct ctg cag gec cca 

Gly Val Leu Se^ Ser Gly Cys Glu Leu Pro Glu Ser Leu Gin Ala Pro 

130 185 190 

teg agg gaa gaa get gec aag tgg age cag gtc egg aaa gat ctg tgc 

Ser Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys 

195 200 205 

tct ttg aag gtc tct ctg cag ctg egg ggg gag gat ggc agt gtc tgg 

Ser Leu Lys Val Ser Leu Gin Leu Arg Gly Glu Asp Gly Ser Val Trp 

210 215 220 

aac tac aaa ccc cca gec gac agt ggc ggg aaa gag ate ttc tec ctg 

Asn Tyr Lys Pro Pro Ala Asp Ser Gly Gly Lys Glu He Phe Ser Leu 

225 230 235 

ctg ccc cac atg get gac atg tea acc tac atg ttc aaa ggc ate ate 

Leu Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys Gly He He 

240 . 245 250 255 

age ttt gec aaa gtc ate tec tac ttc agg gac ttg ccc ate gag gac 

Ser Phe Ala Lys Val lie Ser Tyr Phe Arg Asp Leu Pro lie Glu Asp 

260 265 270 

cag ate tec ctg ctg aag ggg gec get ttc gag ctg tgt caa ctg aga 

Gin He Ser Leu Leu Lys Gly Ala Ala Phe Glu Leu Cys Gin Leu Arg 

275 280 285 

ttc aac aca gtg ttc aac gcg gag act gga acc tgg gag tgt ggc egg 

Phe Asn Thr Val Phe Asn Ala Glu Thr Gly Thr Trp Glu Cys Gly Arg 

290 295 300 

ctg tec tac tgc ttg gaa gac act gca ggt ggc ttc cag caa ctt eta 

Leu Ser Tyr Cys Leu Glu Asp Thr Ala Gly Gly Phe Gin Gin Leu Leu 

305 310 315 

ctg gag ccc atg ctg aaa ttc cac tac atg ctg aag aag ctg cag ctg 

Leu Glu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys Leu Gin Leu 

320 325 330 335 

cat gag gag gag tat gtg ctg atg cag gec ate tec etc ttc tec cca 

Kis Glu Glu Glu Tyr val Leu Met Gin Ala He Ser Leu Phe Ser Pro 

340 345 350 

gac cgc cca ggt gtg ctg cag cac cgc gtg gtg gac cag ctg cag gag 

Asp Arg Pro Gly Val Leu Gin His Arg Val Val Asp Gin Leu Gin Glu 

355 360 365 

caa ttc gec att act ctg aag tec tac att gaa tgc aat egg ccc cag 

Gin Phe Ala lie Thr Leu Lys Ser Tyr He Glu Cys Asn Arg Pro Gin 

370 375 380 

cct get cat agg ttc ttg ttc ctg aag ate atg get atg etc acc gag 

Pro Ala His Arg Phe Leu Phe Leu Lys He Met Ala Met Leu Thr Glu 

385 390 395 

etc cgc age ate aat get cag cac acc cag egg ctg ctg cgc ate cag 

Leu Arg Ser He Asn Ala Gin His Thr Gin Arg Leu Leu Arg He Gin 

400 405 ' 410' 415 

gac ata cac ccc ttt get acg ccc etc atg cag gag ttg ttc ggc ate 

Asp He His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu Phe Gly He 

420 425 430 

aca ggt age tga gcggctgccc ttgggtgaca cctccgagag gcagccagac 
Thr Gly Ser 
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825 

876 

924 

972 
1020 
106B 
1116 
1164 
1212 
1260 
1308 
1356 
1404 
1452 
1500 
1548 
1596 
1648 
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aatgccctgc tggcctgtct ccctagggaa ttcctgctat gacagctggc tagcattcct 1768 

caggaaggac atgggtgccc cccaccccca gttcagtctg tagggagtga agccacagac 1828 

tcttacgtgg agagtgcact gacctgtagg tcaggaccat cagagaggca aggttgccct 1888 

ttccttttaa aaggccctgt ggtctgggga gaaatccctc agatcccact aaagtgtcaa 1948 

ggtgtggaag ggaccaagcg accaaggata ggccatctgg ggtctatgcc cacataccca 2008 

cgtttgttcg cttcctgagt cttttcattg ctacctctaa tagtcctgtc tcccacttcc 2068 

caotcgttcc cctcctcttc cgagctgctt tgtgggctcc aggcctgtac tcatcggcag 2128 

2146 

gtgcatgagt atctgtgg 

<210> 2 

<211> 434 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Leu Glu Val Arg Pro Lys Glu Ser Trp Asn His Ala Asp Phe Val His 
1 5 1° / 5 



Cys Glu Asp Thr Glu Ser Val Pro Gly Lys Pro Ser Val Asn Ala Asp 
Y 20 25 30 

Glu Glu Val Gly Gly Pro Gin lie Cys Arg Val Cys Gly Asp Lys Ala 
35 40 45 

Thr Gly Tyr His Phe Asn Val Met Thr Cys Glu Gly Cys Lys Gly Phe 
50 55 60 

Phe Arg Arg Ala Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe Arg 
65 ™ 75 

Lys Gly Ala Cys Glu He Thr Arg Lys Thr Arg Arg Gin Cys Gin Ala 

Cys Arg Leu Arg Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met He 
100 105 ilu 

Met Ser Asp Glu Ala Val Glu Glu Arg Arg Ala Leu lie Lys Arg Lys 
115 120 12b 
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Lys Ser Glu Arg Thr Gly Thr Gin 
130 135 



Glu Glu Gin Arg Met Met lie Arg 
145 .. 150 



Thr Phe Asp Thr Thr Phe Ser His 
165 



Pro Leu Gly Val Gin Gly Leu Thr 
140 



Glu Leu Met Asp Ala Gin Met Lys 
. 155 16° 



Phe Lys Asn Phe Arg Leu Pro Gly 
170 175 



Val Leu Ser Ser Gly Cys Glu Leu Pro Glu Ser Leu Gin Ala Pro Ser 
180 185 190 



Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys Ser 
195 200 205 



Leu Lys Val Ser Leu Gin Leu Arg Gly Glu Asp Gly Ser Val Trp Asn 
210 215 220 

Tyr Lys Pro Pro Ala Asp Ser Gly Gly Lys Glu He Phe Ser Leu Leu 
225 230 235 

Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys Gly He lie Ser 
245 250 

Phe Ala Lys Val lie Ser Tyr Phe Arg Asp Leu Pro He Glu Asp Gin 
260 265 



lie Ser Leu Leu Lys Gly Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe 
275 280 285 

Asn Thr Val Phe Asn Ala Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu 
290 295 300 

Ser Tyr Cys Leu Glu Asp Thr Ala Gly Gly Phe Gin Gin Leu Leu Leu 



305 



310 315 



Glu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys Leu Gin Leu His 
325 330 



Glu Glu Glu Tyr Val Leu Met Gin Ala He Ser Leu Phe Ser Pro Asp 



340 



345 



Arg 



Pro Gly Val Leu Gin His Arg Val Val Asp Gin Leu Gin Glu Gin 



355 



360 
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Phe Ala lie Thr Leu Lys Ser Tyr lie Glu Cys Asn Arg Pro Gin Pro 
37.0 375 380 

Ala His Arg Phe Leu Phe Leu Lys He Met Ala Met Leu Thr Glu Leu 
385 390 395 

Arg Ser He Asn Ala Gin His Thr Gin Arg Leu Leu Arg He Gin Asp 

405 4 ^ 

lie His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu Phe Gly He Thr 
420 425 



Gly Ser 

<210> 3 

<211> 690 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(690) 



<300> 

<301> Lehmann et al . 

<302> The human orphan nuclear receptor PXR 
.pounds that regulate CYP3A4 gene expression and cause drug 



com; 
interactions 



<303> Journal of Clinical Investigation 

<304> 102 

<305> 5 

<306> 1016-23 

<307> 1998 

<308> Swiss-Prot/075469; GenBank/AFO 6105 6 
<309> 1999-01-20 



WO 02/095652 



PCT/US02/i57()l 



<313> (916) . . (1605) 



<400> 3 

gat ctg tgc tct.ttg aag gtc tct ctg cag ctg egg ggg gag gat ggc 48 

Asp Leu Cys Ser Leu Lys Val Ser Leu Gin Leu Arg Gly Glu Asp Gly 

1.5 10 15 

agt gtc tgg aac tac aaa ccc cca gec gac agt ggc ggg aaa gag ate 96 

Ser Val Trp Asn Tyr Lys Pro Pro Ala Asp Ser Gly Gly Lys Glu lie 

20 25 30 



ttc tec ctg ctg ccc cac atg get gac atg tea acc tac atg ttc aaa 
Phe Ser Leu Leu Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys 
35 40 45 



144 



ggc ate ate age ttt gee aaa gtc ate tec tac ttc agg gac ttg ccc 192 
Gly lie lie Ser Phe Ala Lys Val lie Ser Tyr Phe Arg Asp Leu Pro 
50 55 60 

ate gag gac cag ate tec ctg ctg aag ggg gee get ttc gag ctg tgt 240 
lie Glu Asp Gin He Ser Leu Leu Lys Gly Ala Ala Phe Glu Leu Cys 
65 i 70 75 80 

caa ctg aga ttc aac aca gtg ttc aac gcg gag act gga acc tgg gag 288 
Gin Leu Arg Phe Asn Thr Val Phe Asn Ala Glu Thr Gly Thr Trp Glu 
85 90 95 

tgt ggc egg ctg tec tac tgc ttg gaa gac act gca ggt ggc ttc cag 336 
Cys Gly Arg Leu Ser Tyr Cys Leu Glu Asp Thr Ala Gly Gly Phe Gin 
100 105 110 

caa ctt eta ctg gag ccc atg ctg aaa ttc cac tac atg ctg aag aag 384 
Gin Leu Leu Leu Glu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys 
115 120 125 

ctg cag ctg cat gag gag gag tat gtg ctg atg cag gee ate tec etc 432 
Leu Gin Leu His Glu Glu Glu Tyr Val Leu Met Gin Ala lie Ser Leu 
130 135 140 

ttc tec cca gac cgc cca ggt gtg ctg cag cac cgc gtg gtg gac cag 480 
Phe Ser Pro Asp Arg Pro Gly Val Leu Gin His Arg Val Val Asp Gin 
145 150 155 160 

ctg cag gag caa ttc gec att act ctg aag tec tac att gaa tgc aat 528 
Leu Gin Glu Gin Phe Ala He Thr Leu Lys Ser Tyr He Glu Cys Asn 
165 170 175 

egg ccc cag cct get cat agg ttc ttg ttc ctg aag ate atg get atg 576 
Arg Pro Gin Pro Ala His Arg Phe Leu Phe Leu Lys He Met Ala Met 
180 185 190 

etc acc gag etc cgc age ate aat get cag cac acc cag egg ctg ctg 624 
Leu Thr Glu Leu Arg Ser He Asn Ala Gin His Thr Gin Arg Leu Leu 
195 200 205 

cgc ate cag gac ata cac ccc ttt get acg ccc etc atg cag gag ttg 672 
Arg He Gin Asp lie His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu 
210 215 220 

ttc ggc ate aca ggt age 690 
Phe Gly He Thr Gly Ser 
225 230 



<210> 4 
<211> 230 
<212> PRT 



WO 02/095652 



8 



PCT/US02/15701 



<213> Homo sapiens 



<400> 4 

Leu Cys Ser Leu Lys Val Ser Leu Gin Leu Arg Gly Glu Asp Gly 



Asp Leu Cy 
1 



10 



Ser Val Trp Asn Tyr Lys Pro Pro Ala Asp Ser Gly Gly Lys Glu lie 

25 ■ 3U 



20 



Phe Ser Leu Leu 
35 



Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys 
40 45 



Gly lie lie Ser Phe Ala Lys Val He Ser Tyr Phe Arg Asp Leu Pro 



50 . 



55 



He Glu 
65 



Asp Gin He Ser Leu Leu Lys Gly Ala Ala Phe Glu Leu Cys 



70 



75 



Gin Le 



u Arg Phe Asn Thr Val Phe Asn Ala Glu Thr Gly Thr Trp Glu 



85 



90 



Cys Gly Arg Leu Ser Tyr Cys Leu Glu As P Thr Ala Gly Gly Phe Gin 



100 



Gin Leu Leu Leu G 
115 



lu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys 



120 



125 



Leu 



Gin Leu His Glu Glu Glu Tyr Val Leu Met Gin Ala lie Ser Leu 



130 



135 



Phe Ser Pro Asp Arg Pro Gly Val Leu Gin His Arg Val Val Asp Gin 
145 150 155 



Leu Gin Glu Gin Phe Ala He Thr Leu Lys Ser Tyr He Glu Cys Asn 



165 



170 



Arg Pro Gin Pro A 
180 



,1a His Arg Phe Leu Phe Leu Lys He Met Ala Met 



185 



190 



Leu Thr Glu Leu Arg Ser lie Asn Ala Gin His Thr Gin Arg Leu Leu 



195 



200 



D . a nl _ Thr Pro Leu Met Gin Glu Leu 
Arg He Gin Asp He His Pro Phe Ala Thr fro u 
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210 



215 



220 



Phe Gly He Thr Gly Ser 
225 230 



<210> 5 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Glu Aspi Thr Ala Gly Gly Phe Gin Gin Leu Leu Leu Glu 
1 ■ 5 1° 

<210> 6 

' <211> 4604 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (116) . . (1399) 



<300> 

<301> Baker et al . 

«-f fnn-lenath cDNA encoding 
<302> Cloning and expression of full lengtn 

human vitamin D receptor 

<303> Proc. Natl. Acad. Sci. U.S.A. 

<304> 85 

<305> 10 

<306> 3294-98 

<307> 1988 

<308> Swiss-Prot/11473; GenBank/ J03258 
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<309> 1995-01-14 
<313> (116) . . (1399) 



<400> 6. 

ggaacagctt gtccacccgc cggccggacc agaagccttt gggtctgaag tgtctgtgag 60 

acctcacaga agagcacccc tgggctccac ttacctgccc cctgctcctt caggg atg 118 

Met 
1 

gag gca atg gcg gcc age act tec ctg cct gac cct gga gac ttt gac 156 
Glu Ala Met Ala Ala Ser Thr Ser Leu Pro Asp Pro Gly Asp Phe Asp 
5 10 15 

egg aac gtg ccc egg ate tgt ggg gtg tgt gga gac cga gcc act ggc 214 
Arg Asn Val Pro Arg Xle Cys Gly Val Cys Gly Asp Arg Ala Thr Gly 
20 25 30 

ttt cac ttc aat get atg ace tgt gaa ggc tgc aaa ggc ttc ttc agg 262 
Phe His Hhe Asn Ala Met Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg 
35 40 45 

cga age atg aag egg aag gca eta ttc ace tgc ccc ttc aac ggg gac 310 
Arg Ser Met Lys Arg Lys Ala Leu Phe Thr Cys Pro Phe Asn Gly Asp 
50 55 60 65 

tgc cgc ate acc aag gac aac cga cgc cac tgc cag gcc tgc egg etc 358 
Cys Atg He Thr Lys Asp Asn Arg Arg His Cys Gin Ala Cys Arg Leu 
70 75 80 

aaa cgc tgt gtg gac ate ggc atg atg aag gag ttc att ctg aca gat 406 
Lys Arg Cys Val Asp He Gly Met Met Lys Glu Phe He Leu Thr Asp 
85 90 95 

gag gaa gtg cag agg aag egg gag atg ate ctg aag egg aag gag gag 454 
Glu Glu Val Gin Arg Lys Arg Glu Met He Leu Lys Arg Lys Glu Glu 
100 105 HO 

gag gcc ttg aag gac agt ctg egg ccc aag ctg tct gag gag cag cag 502 
Glu Ala Leu Lys Asp Ser Leu Arg Pro Lys Leu Ser Glu Glu Gin Gin 
115 120 125 

cgc ate att gcc ata ctg ctg gac gcc cac cat aag acc tac gac ccc 550 
Arg He He Ala He Leu Leu Asn Ala His His Lys Thr Tyr Asp Pro 
130 135 140 145 

acc tac tec gac ttc tgc cag ttc egg cct cca gtt cgt gtg aat gat 598 
Thr Tyr Ser Asp Phe Cys Gin Phe Arg Pro Pro Val Arg Val Asn Asp 
150 155 160 

ggt gga ggg age cat cct tec agg ccc aac tec aga cac act ccc age 54 6 

Gly Gly Gly Ser His Pro Ser Arg Pro Asn Ser Arg His Thr Pro Ser 
165 170 175 

ttc tct ggg gac tec tec tec tec tgc tea gat cac tgt ate acc tct 694 
Phe Ser Gly Asp Ser Ser Ser Ser Cys Ser Asp His Cys He Thr Ser 
180 IBS • 190 

tea gac atg atg gac teg tec age ttc tec aat ctg gat ctg agt gaa 7 42 

Ser Asp Met Met Asp Ser Ser Ser Phe Ser Asn Leu Asp Leu Ser Glu 
195 200 205 

gaa gat tea gat gac cct tct gtg acc eta gag ctg tec cag etc tec 790 
Glu Asp Ser Asp Asp Pro Ser Val Thr Leu Glu Leu Ser Gin Leu Ser 
210 215 220 225 

atg ctg ccc cac ctg get gac ctg gtc agt tac age ate caa aag gtc 838 
Met Leu Pro His Leu Ala Asp Leu Val Ser Tyr Ser He Gin Lys Val 
230 235 240 
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att ggc ttt get aag atg ata cca gga ttc aga 

lie Gly Phe Ala Lys Met lie Pro Gly Phe Arg 
245 250 

gac cag ate gta ctg ctg aag tea agt gec att 

Asp Gin lie Val Leu Leu Lys Ser Ser Ala lie 
260 ^ 265 

cgc tec aat gag tec ttc ace atg gac gac atg 

Arg Ser Asn Glu Ser Phe Thr Met Asp Asp Met 
275 280 



gac etc acc tct gag 8e6 
Asp Leu Thr Ser Glu 
255 

gag gtc ate atg ttg 934 
Glu Val lie Met Leu 
270 

tec tgg acc tgt ggc 932 

ser Trp Thr Cys Gly 

285 



aac caa gac tac aag tac cgc gtc agt gac gtg 

Asn Gin Asp Tyr Lys Tyr Arg Val Ser Asp Val 

290 295 300 

age ctg gag ctg att gag ccc etc ate aag ttc 

Ser Leu Glu Leu lie Glu Pro Leu lie Lys Phe 

310 315 



acc aaa gec gga cac 1030 
Thr Lys Ala Gly His 
305 

cag gtg gga ctg aag 1078 
Gin Val Gly Leu Lys 
320 



aag ctg aac ttg cat gag gag gag cat gtc ctg 

Lys Leu Asn Leu His Glu Glu Glu His Val Leu 
325 330 

ate gtc tec cca gat cgt cct ggg gtg cag gac 

He Val Ser Pro Asp Arg Pro Gly Val Gin Asp 

340 345 



etc atg gee ate tgc 1126 
Leu Met Ala lie Cys 
335 

gee gcg ctg att gag 1174 
Ala Ala Leu lie Glu 
350 



gec ate cag gac cgc ctg tec aac aca ctg cag 
Ala lie Gin Asp Arg Leu Ser Asn Thr Leu Gin 
355 360 



acg tac ate cgc tgc 
Thr Tyr lie Arg Cys 
365 



1222 



cgc cac ccg ccc ccg ggc age cac ctg etc tat 

Arg His Pro Pro Pro Gly Ser His Leu Leu Tyr 

370 375 380 

aag eta gec gac ctg cgc age etc aat gag gag 

Lys Leu Ala Asp Leu Arg Ser Leu Asn Glu Glu 

390 395 



gee aag atg ate cag 
Ala Lys Met He Gin 
385 

cac tec aag cag tac 
His Ser Lys Gin Tyr 
400 



1270 



13i8 



cgc tgc etc tec ttc cag cct gag tgc age atg 
Arg Cys Leu Ser Phe Gin Pro Glu Cys Ser Met 
405 410 



aag eta acg ccc ctt 
Lys Leu Thr Pro Leu 
415 



1366 



gtg etc gaa gtg ttt ggc aat gag ate tec tga 
Val Leu Glu Val Phe Gly Asn Glu He Ser 
420 425 



ctaggacagc ctgtgcggtg 1419 



cctgggtggg 


gctgctcctc 


cagggccacg 


tgccaggccc 


ggggctggcg 


gctactcagc 


1479 


agccctcctc 


acccgtctgg 


ggttcagccc 


ctcctctgcc 


acctccccta 


tccacccagc 


1539 


ccattctctc 


tcctgtccaa 


cctaacccct 


ttcctgcggg 


cttttccccg 


gtcccttgag 


1599 


acctcagcca 


tgaggagttg 


ctgtttgttt 


gacaaagaaa 


cccaagtggg 


ggcagagggc 


1659 


agaggctgga 


ggcaggcett 


geccagagat 


gcctccaccg 


ctgcctaagt 


ggctgetgae 


1719 


tgatgttgag 


ggaacagaca 


ggagaaatgc 


atccattcct 


cagggacaga 


gacacctgca 


1779 


cctcccccca 


ctgcaggccc 


cgcttgtcca 


gcgcctagtg 


gggtctccct 


ctcctgcctt 


1839 


actcacgata 


aataategge 


ccacagctcc 


caccccaccc 


ccttcagtgc 


ccaccaacat 


1899 


cccattgccc 


tggttatatt 


ctcacgggca 


gtagctgtgg 


tgaggtgggt 


tttcttccca 


1959 


tcactggagc 


accaggcacg 


aacccacctg 


ctgagagacc 


caaggaggaa 


aaacagacaa 


2019 


aaacagcctc 


acagaagaat 


atgacagctg 


tccctgtcac 


caagctcaca 


gttcctcgcc 


2079 


ctgggtctaa 


ggggttggtt 


gaqqtqgaag 


ccctccctcc 


aeggatccat 


gtagcaggec 


2139 


tgaattgtcc 


ccagttfcgca 


gaaaagcacc 


tgccgacctc 


gtcctccccc 


tgccagtgcc 


2199 


ttacctccfcg 


cccaggagag 


ccagccctcc 


ctgtcctcct 


cggatcaccg 


agagtagecg 


2259 
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agagcctgct cccccacccc ctccccaggg gagagggtct ggagaagcag tgagccgcat 2319 

cttctccatc tggcagggtg ggatggagga gaagaatttt cagaccccag cggctgagtc 2379 

atgatctccc tgccgcctca atgtggttgc aaggccgctg ttcaccacag ggctaagagc 2439 

taggctgccg csccccagag tgtgggaagg gagagcgggg cagtctcggg tggctagtca 2499 

gagagagtgt ttgggggttc cgtgatgtag ggtaaggtgc cttcttattc tcactccacc 2559 

acccaaaagt caaaaggtgc ctgtgaggca ggggcggagt gatacaactt caagtgcatg 2619 

ctctctgcag gtcgagccca gcccagctgg tgggaagcgt ctgtccgttt actccaaggt 2679 

gggtctttgt gagagtgagc tgtaggtgtg cgggaccggt acagaaaggc gttcttcgag 2739 

gtggatcaca gaggcttctt cagatcaatg cttgagtttg gaatcggccg cattccctga 2799 

gtcaccagga atgttaaagt cagfcgggaac gtgactgccc caactcctgg aagctgtgtc 2B59 

cttgcacctg catccgtagt tccctgaaaa cccagagagg aatcagactt cacactgcaa 2919 

gagccttggt gtccacctgg ccccatgtct ctcagaattc ttcaggtgga aaaacatctg 2979 

aaagccacgt tccttactgc agaatagcat atatatcgct taatcttaaa tttattagat 3039 

atgagttgtt ttcagactca gactccattt gtattatagt ctaatataca gggtagcagg 3099 

taccactgat ttggagatat ttatgggggg agaacttaca ttgtgaaact tctgtacatt 3159 

aattattatt gctgttgtta ttttacaagg gtctagggag agacccttgt ttgattttag 3219 

ctgcagaact gtattggtcc agcttgctct tcagtgggag aaaaacactt gtaagttgct 327 9 

aaacgagtca- atcccctcat tcaggaaaac tgacagagga gggcgtgact cacccaagcc 3339' 

atatataact agctagaagt gggccaggac aggccgggcg cggtggctca cgcctgtaat 3399 

cccagcagtt tgggaggtcg aggtaggtgg atcacctgag gtcgggagtt cgagaccaac 3459 

ctgaccaaca tggagaaacc ctgtctctat taaaaataca aaaaaaaaaa aaaaaaaaaa 3519 

tagccgggca tggtggcgca agcctgtaat cccagctact caggaggctg aggcagaaga 3579 

attgaaccca ggaggtggag gttgcagtga gctgagatcg tgccgttact ctccaacctg 3639 

gacaacaaga gcgaaactcc gtcttagaag tggaccagga caggaccaga ttttggagtc 3699 

atggtccggt gtccttttca ctacaccatg tttgagctca gacccccact ctcattcccc 3759 

aggtggctga cccagtccct gggggaagcc ctggatttca gaaagagcca agtctggatc 3819 

tgggaccctt tccttccttc cctggcttgt aactccacca agcccatcag aaggagaagg 3879 

aaggagactc acctctgcct caatgtgaat cagaccctac cccaccacga tgtgccctgg 3939 

ctgctgggct ctccacctca ggccttggat aatgctgttg cctcatctat aacatgcatt 3999 

tgtctttgta atgtcaccac cttcccagct ctccctctgg ccctgcttct tcggggaact 4059 

cctgaaatat cagttactca gccctgggcc ccaccaccta ggccactcct ccaaaggaag 4119 

tctaggagct gggaggaaaa gaaaagaggg gaaaatgagt ttttatgggg ctgaacgggg 4179 

agaaaaggtc atcatcgatt ctactttaga atgagagtgt gaaatagaca tttgtaaatg 4239 

taaaactttt aaggtatatc attataactg aaggagaagg tgccccaaaa tgcaagattt 4299 

tccacaagat tcccagagac aggaaaatcc tctggctggc taactggaag catgtaggag 4359 

aatccaagcg aggtcaacag agaaggcagg aatgtgtggc agatttagtg aaagctagag 4419 

atafcggcagc gaaagqatqt aaacaqtgcc tgctqaatga tttccaaaga gaaaaaaagt 4479 
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ttgccagaag tttgtcaagt caaccaatgt agaaagcttt gcttatggta ataaaaatgg 4539 
ctcatactta tatagcactt actttgtttg caagtactgc tgtaaataaa tgctttatgc 45S9 

4604 

aaacc 



<210> 7 

<211> 427 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Met Glu Ala Met Ala Ala Ser Thr Ser Leu Pro Asp Pro Gly Asp Phe 
1 i 5 10 

Asp Arg Asn Val Pro Arc, He Cys Gly VI Cys Gly Asp Arc, Ala Thr 

20 25 



Gly Phe His Phe Asn Ala Met Thr Cys Glu Gly Cys Lys Gly Phe Phe 
35 40 45 

Arg Arg Ser Met Lys Arg Lys Ala' Leu Phe Thr Cys Pro Phe Asn Gly 
50 55 60 

Asp Cys Arg He Thr Lys Asp Asn Arg Arg His Cys Gin Ala Cys Arg 

65 70 

L eu Lys Arg Cys Val Asp He Gly Met Met Lys Glu Phe He Leu Thr 



85 90 



Asp Glu Glu Val Gin Arg Lys Arg Glu Met He Leu Lys Arg Lys Glu 

100 105 

Glu Glu Ala Leu Lys Asp Ser Leu Arg Pro Lys Leu Ser Glu Glu Gin 

115 120 

Gin Arg He He Ala He Leu Leu Asp Ala His His Lys Thr Tyr Asp 

130 135 

Pro Thr Tyr Ser Aso Phe Cys Gin Phe Arg Pro Pro Val Arg Val Asn 



14 5 



Asp 



150 



Gly Gly Gly Ser His Pro Ser Arg Pro Asn Ser Arg His Thr Pro 



WO02/0956S2 PCT/US02/15701 

14 

165 no 1 75 

Ser Phe Ser Gly Asp Ser Ser Ser Ser Cys Ser Asp His Cys lie Thr 
180 185' 190 

Ser Ser Asp Met Met Asp Ser Ser Ser Phe Ser Asn Leu Asp Leu Ser 
!95 200 205 

Glu Glu Asp Ser Asp Asp Pro Ser Val Thr Leu Glu Leu Ser Gin Leu 
210 215 220 

Ser Met Leu Pro His Leu Ala Asp Leu Val Ser Tyr Ser lie Gin Lys 
225 230 235 

Val Ile«Gly Phe Ala Lys Met lie Pro Gly Phe Arg Asp Leu Thr Ser 
245 250 

Glu Asp Gin lie Val Leu Leu Lys Ser Ser Ala He Glu Val lie Met 
260 265 270 

Leu Arg Ser Asn Glu Ser Phe Thr Met Asp Asp Met Ser Tr P Thr Cys 
27 5 280 285 

Gly Asn Gin Asp Tyr Lys Tyr Arg Val. Ser Asp Val Thr Lys Ala Gly 
290 295 300 

His Ser Leu Glu Leu lie Glu Pro Leu lie Lys Phe Gin Val Gly Leu 
305 310 315 

Lys Lys Leu Asn Leu His Glu Glu Glu His Val Leu Leu Met Ala lie 
325 330 33b 



Cys lie Val Ser Pro Asp Arg Pro Gly Val Gin Asp Ala Ala Leu He 
Y 340 345 350 

Glu Ala lie Gin Asp Arg Leu Ser Asn Thr Leu Gin Thr Tyr He Arg 
355 360 365 

Cys Arg His Pro Pro Pro Gly Ser His Leu Leu Tyr Ala Lys Met He 
370 375 380 

Gin Lys Leu Ala Asp Leu Arg Ser Leu Asn Glu Glu His Ser Lys Gin 
385 390 395 

Tyr Arg Cys Leu Ser Phe Gin Pro Glu Cys Ser Met Lys Leu Thr Pro 



WO 02/095652 



PCT/US02/15701 



15 



405 



410 



415 



Leu Val Leu Glu Val Phe Gly Asn Glu lie Ser 



<210> 8 

<211> 708 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)... (708) 

<300> 

<301> Baker et al. 



human vitamin D receptor 

<303> Proc. Natl. Acad. Sci. U.S.A. 

<304> 85 

<305> 10 

<306> 3294-98 

<307> 1988 

<308> Swiss-Prot/11473; GenBank/J03258 

<309> 1995-01-14 
<313> (689) . . (1396) 



420 



425 



<302> 



Cloning and expression 



of full-length cDNA encoding 




X 5 

agt gaa gaa gat tea gat g, 
Ser Glu Glu Asp Ser Asp A 
20 



48 




96 



144 
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aag gtc att g gc ttt get aag atg ata cca gga ttc aga gac etc acc 192 

Lys Val He Gly Phe Ala Lys Met He Pro Gly Phe Arg Asp Leu Thr 

5C 55 60 

tct gag gac cag ate gta ctg ctg aag tea agt gec att gag gtc ate 240 

Ser Glu Asp Gin He Val Leu Leu Lys Ser Ser Ala lie Glu Val He 

65 70 75 80 

atg ttg cgc tec aat gag tec ttc acc atg gac gac atg tec tgg acc 268 

Met Leu Arg Ser Asn Glu Ser Phe Thr Met Asp Asp Met Ser Trp Thr 
85 90 95 

tgt ggc aac caa gac tac aag tac cgc gtc agt gac gtg acc aaa gee 336 

Cys Gly Asn Gin Asp Tyr Lys Tyr Arg Val Ser Aso Val Thr Lys Ala 

100 105 * . HO 

gga cac age ctg gag ctg att gag ccc etc ate aag ttc cag gtg gga 384 

Gly His Ser Leu Glu Leu He Glu pro Leu He Lys Phe Gin Val Gly 

115 120 125 

ctg aag aag ctg aac ttg cat gag gag gag cat gtc ctg etc atg gee 432 

Leu Lys Lys Leu Asn Leu His Glu Glu Glu His Val Leu Leu Met Ala 

130 . 135 140 



ate tgc ate gtc tec cca gat cgt cct ggg gtg cag gac gee gcg ctg 
He Cys He Val Ser Pro Asp Arg Pro Gly Val Gin Asp Ala Ala Leu 
145 150 155 160 



480 



att gag gec ate cag gac cgc ctg tec aac aca ctg cag acg tac ate 528 

lie Glu Ala He Gin Asp Arg Leu Ser Asn Thr Leu Gin Thr Tyr He 
165 170 175 

cgc tgc cgc cac ccg ccc ccg ggc age cac ctg etc tat gec aag atg 576 

Arg cys Arg His Pro Pro Pro Gly Ser His Leu Leu Tyr Ala Lys Met 

180 185 190 y;.' 

ate cag aag eta gec gac ctg cgc age ct;c aat gag gag cac tec aag 624 

He Gin Lys Leu Ala Asp Leu Arg Ser Leu Asn Glu Glu His Ser Lys 

195 200 205 

cag tac cgc tgc etc tec ttc cag cct gag tgc age atg aag eta acg 672 

Gin Tyr Arg Cys Leu Ser Phe Gin Pro Glu Cys Ser Met Lys Leu Thr 

210 215 220 

ccc ctt gtg etc gaa gtg ttt ggc aat gag ate tec 708 

Pro Leu Val Leu Glu Val Phe Gly Asn Glu lie Ser 
225 230 235 



<210> 9 

<2U> 236 

<212> PRT 

<213> Homo sapiens 



<400> 9 

Thr Ser Ser Asp Met Met Asp Ser Ser Ser Phe Ser Asn Leu Asp Leu 
1 5 10 15 



Ser Glu Glu Asp Ser Asp Asp Pro Ser Val Thr Leu Glu Leu Ser Gin 
20 * 25 30 
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Leu Ser Met Leu Pro His Leu Ala Asp Leu Val Ser Tyr Ser He Gin 
35 40 45 

Lys Val lie' CI y Phe Ala Lys Met He Pro Gly Phe Arg Asp Leu Thr 
50 55 60 

Ser Glu Asp Gin He Val Leu Leu Lys Ser Ser Ala He Glu Val lie 
65 70 75 8° 

Met Leu Arg Ser Asn Glu Ser Phe Thr Met Asp Asp Met Ser Trp Thr 
85 90 95 

Cys Gly Asn Gin Asp Tyr Lys Tyr Arg Val Ser Asp Val Thr Lys Ala 
100 105 11 

Gly His Ser Leu Glu Leu He Glu Pro Leu He Lys Phe Gin Val Gly 
115 120 I 25 

Leu Lys Lys Leu Asn Leu His Glu Glu Glu His Val Leu Leu Met Ala 
130 135 140 



He Cys He Val Ser Pro Asp Arg Pro Gly Val Gin Asp Ala Ala Leu 

- J -r„ ' 155 I 60 



145 



150 



lie Glu Ala He Gin Asp Arg Leu Ser Asn Thr Leu Gin Thr Tyr He 

165 no 175 



Arg Cys Arg His Pro Pro Pro Gly Ser His Leu Leu Tyr Ala Lys Met 

-.on 185 190 



180 



He Gin Lys Leu Ala Asp Leu Arg Ser Leu Asn Glu Glu His Ser Lys 



Lys 
195 



200 



205 



Gin Tyr Arg Cys Leu Ser Phe Gin Pro Glu Cys Ser Met Lys Leu Thr 
210 215 220 



Pro Leu Val Leu Glu Val Phe Gly Asn Glu lie Ser 
225 230 235 



<210> 10 
<211> 951 



<212> DNA 



WO 02/095652 
<213> 



Homo sapiens 



PCT/US02/15701 



<220> 

<221> CDS 

<222> (1)..{951) 



<400> 


10 






atg aaa 


aaa 


ggt 


cat 


Met Lys 


Lys 


Gly 


His 


1 






5 


act cag 


cca 


ctg 


gga 


Thr Gin 


Pro 


Leu 


Gly 



10 15 



48 



96 



20 25 30 

ate agg gag ctg atg gac get cag atg aaa acc ttt gac act acc ttc 14 4 

lie Arg Glu Leu Met Asp Ala Gin Met Lys Thr Phe Asp Thr Thr Phe 
35 40 45 

tec cat ttc aag aat ttc egg ctg cca ggg gtg ctt age agt ggc tgc 192 

Sec His Phe Lys Asn Phe Arg Leu Pro Gly Val Leu Ser Ser Gly Cys 
50 55 60 

gag ttg cca gag tct ctg cag gec cca teg agg gaa gaa get gee aag 24 0 

Glu Leu Pro Glu Ser Leu Gin Ala Pro Ser Arg Glu Glu Ala Ala Lys 

65 70 75 80 

tgg age cag gtc egg aaa gat ctg tgc tct ttg aag gtc tct ctg cag 288 

Trp Ser Gin Val Arg Lys Asp Leu Cys Ser Leu Lys Val Ser Leu Gin 

85 90 95 

ctg egg ggg gag gat ggc agt gtc tgg aac tac aaa ccc cca gec gac 336 

Leu Arg Gly Glu Asp Gly Ser Val Trp Asn Tyr Lys Pro Pro Ala Asp 

100 105 110 

agt ggc ggg aaa gag ate ttc tec ctg ctg ccc cac atg get gac atg 384 

Ser Gly Gly Lys Glu lie Phe Ser Leu Leu Pro His Met Ala Asp Met 
115 120 125 

tea acc tac atg ttc aaa ggc ate ate age ttt gec aaa gtc ate tec 432 

Ser Thr Tyr Met Phe Lys Gly lie lie Ser Phe Ala Lys Val lie Ser 
130 135 140 

tac ttc agg gac ttg ccc ate gag gac cag ate tec ctg ctg aag ggg 480 

Tyr Phe Arg Asp Leu Pro lie Glu Asp Gin lie Ser Leu Leu Lys Gly 

1^5 150 155 160 



gec get ttc gag ctg tgt caa ctg aga ttc aac aca gtg ttc aac gcg 528 
Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe Asn Ala 
165 170 175 

gag act gga acc tgg gag tgt ggc egg ctg tec tac tgc ttg gaa gac 576 
Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu Glu Asp 
180 185 190 

act gca ggt ggc ttc cag caa ctt eta ctg gag ccc atg ctg aaa ttc 624 
Thr Ala Gly Gly Phe Gin Gin Leu Leu Leu Glu Pro Met Leu Lys Phe 
195 200 205 

cac tac atg ctg aag aag ctg cag ctg cat gag gag gag tat gtg ctg 672 
His Tyr Met Leu Lys Lys Leu Gin Leu His Glu Glu Glu Tyr Val Leu 
210 215 220 

atg cag gec ate tec etc ttc tec cca gac cgc cca ggt gtg ctg cag 720 
Met Gin Ala lie Ser Leu Phe Ser Pro Asp Arg Pro Gly Val Leu Gin 
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225 



230 235 240 



cac cgc gtg gtg gac cag ctg cag gag caa ttc gcc att act ctg aag 
H<s Arg Val Val Asp Gin Leu Gin Glu Gin Phe Ala lie Thr Leu Lys 
245 250 255 

tec tac att gaa tgc aat egg ccc cag cct get cat agg ttc ttg ttc 
Ser Tyr lie Glu Cys Asn Arg Pro Gin Pro Ala His Arg Phe Leu Phe 
260 265 270 

ctg aag ate atg get atg etc acc gag etc cgc age ate aat get cag 
Leu Lys lie Met Ala Met Leu Thr Glu Leu Arg Ser He Asn Ala Gin 
275 280 285 

cac acc cag egg ctg ctg cgc ate cag gac ata cac ccc ttt get acg 
His Thr Gin Arg Leu Leu Arg He Gin Asp He His Pro Phe Ala Thr 
290 295 300 

ccc etc atg cag gag ttg ttc ggc ate aca ggt age tga 
Pro Leu Met Gin Glu Leu Phe Gly He Thr Gly Ser 
310 315 



76B 



616 



864 



912 



951 



305 




<210> 


11 


<211> 


316 


<212> 


PRT 


<213> 


Homo 


<400> 


11 



Met Lys Lys Gly His His His His His His Gly Ser Glu Arg Thr Gly 
15 10 15 

Thr Gin Pro Leu Gly Val Gin Gly Leu Thr Glu Glu Gin Arg Met Met 
20 25 3° 

He Arg Glu Leu Met Asp Ala Gin Met Lys Thr Phe Asp Thr Thr Phe 
35 40 45 

S=r His Phe Lys Asn Phe Arg Leu Pro Gly Val Leu Ser Ser Gly Cys 
50 55 60 

Glu Leu Pro Glu Ser Leu Gin Ala Pro Ser Arg Glu Glu Ala Ala Lys 
65 70 75 80 

Trp Ser Gin Val Arg Lys Asp Leu Cys Ser Leu Lys Val Ser Leu Gin 
85 90 9 5 

Leu Arg Gly Glu Asp Gly Ser Val Trp Asn Tyr Lys Pro Pro Ala Asp 
100 105 

Ser Gly Gly Lys Glu lie Phe Ser Leu Leu Pro His Met Ala Asp Met 
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115 120 125 

Ser Thr Tyr Met Phe Lys Gly He He Ser Phe Ala Lys Val He Ser 
130 135 140 

Tyr Phe Arg Asp Leu Pro lie Glu Asp Gin He Ser Leu Leu Lys Gly 
145 150 155 160 

Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe Asn Ala 
165 170 175 

Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu Glu Asp 
180 185 190 

Thr Ala^Gly Gly Phe Gin Gin Leu Leu Leu Glu Pro Met Leu Lys Phe 
195 200 205 

His Tyr Met Leu Lys Lys Leu Gin Leu His Glu Glu Glu Tyr Val Leu 
210 215 220 

Met Gin Ala He Ser Leu Phe Ser Pro Asp Arg Pro Gly Val Leu Gin 
225 230 235 > 240 

His Arg Val Val Asp Gin Leu Gin Glu Gin Phe Ala He Thr Leu Lys 
245 250 255 

Ser Tyr He Glu Cys Asn Arg Pro Gin Pro Ala His Arg Phe Leu Phe 
260 ■ 265 270 

Leu Lys He Met Ala Met Leu Thr Glu Leu Arg Ser lie Asn Ala Gin 
275 ■ 280 285 



His Thr Gin Arg Leu Leu Arg He Gin Asp He His Pro Phe Ala /Thr 
290 295 300 



Pro Leu Met Gin Glu Leu Phe Gly He Thr Gly Ser 
305 310 315 



<210> 12 

<2H> 276- 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (1) . . (276) 

<300> 

<301> Takeshita et al. 

<302> Molecular cloning and properties of a full-length 
putative thyroid hormone receptor coactivator 



<303> 


Endocrinology 


<304> 


i37 


<305> 


8 


<306> 


3594-3597 


<307> 


1996 


<308> 


GenBank/059302 


<309> 


1996-08-08 


<313> 


(623) . . (710) 



<400> 12 

atg aaa aaa gac agt aaa tac tct caa acc agt cac aaa eta gtg cag 
Met Lys Lys Asp Ser Lys Tyr Ser Gin Thr Ser His Lys Leu Val Gin 

ctt ttg aca aca act gcc gaa cag cag tta egg cat get gat at a gac 
Leu Leu Thr Thr Thr Ala Glu Gin Gin Leu Arg His Ala Asp lie Asp 
20 25 30 

aca age tgc aaa gat gtc ctg tct tgc aca ggc act tec aac tct gcc 
Thr Ser Cys Lys Asp Val Leu Ser Cys Thr Gly Thr Ser Asn Ser Ala 

40 45 



35 



tct get aac tct tea gga ggt tct tgt ccc tct tct cat age tea ttg 
Ser Ala Asn Ser Ser Gly Gly Ser Cys Pro Ser Ser His Ser Ser Leu 
50 55 60 

aca gaa egg cat aaa att eta cac egg etc tta cag gag ggt age ccc 
Thr Glu Arg His Lys lie Leu His Arg Leu Leu Gin Glu Gly Ser Pro 
65 70 ( 75 80 

tea gat ate acc act ttg tct gtc gag cct gat taa 
Ser Asp lie Thr Thr Leu Ser Val Glu Pro Asp 
85 90 



48 



96 



144 



192 



240 



276 



<210> 13 
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<211> 91 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Met Lys Lys Asp Ser Lys Tyr Ser Gin Thr Ser His Lys Leu Val Gin 
15 10 15 

Leu Leu Thr Thr Thr Ala Glu Gin Gin Leu Arg His Ala Asp He Asp 
20 25 30 

Thr Ser Cys Lys Asp Val Leu Ser Cys Thr Gly Thr Ser Asn Ser Ala 
i 35 40 45 

Ser Ala Asn Ser Ser Gly Gly Ser Cys Pro Ser Ser His Ser Ser Leu 
50 55 60 

Thr Glu Arq His Lys He Leu His Arg Leu Leu Gin Glu Gly Ser Pro 
65 70 75 ■ 80 



Ser Asp He Thr Thr Leu Ser Val Glu Pro Asp 
85 90 
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The special technical feature for the inventions of Group X is an expression of the 
mutant mouse PXR polypeptide. 
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